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1 
CORONAVIRUS 


FIELD OF THE INVENTION 


The present invention relates to an attenuated coronavirus 
comprising a variant replicase gene, which causes the virus 
to have reduced pathogenicity. The present invention also 
relates to the use of such a coronavirus in a vaccine to 
prevent and/or treat a disease. 


BACKGROUND TO THE INVENTION 


Avian infectious bronchitis virus (IBV), the aetiological 
agent of infectious bronchitis (IB), is a highly infectious and 
contagious pathogen of domestic fowl that replicates pri- 
marily in the respiratory tract but also in epithelial cells of 
the gut, kidney and oviduct. IBV is a member of the Order 
Nidovirales, Family Coronaviridae, Subfamily Corona viri- 
nae and Genus Gammacoronavirus; genetically very similar 
coronaviruses cause disease in turkeys, guinea fowl and 
pheasants. 

Clinical signs of IB include sneezing, tracheal rales, nasal 
discharge and wheezing. Meat-type birds have reduced 
weight gain, whilst egg-laying birds lay fewer eggs and 
produce poor quality eggs. The respiratory infection predis- 
poses chickens to secondary bacterial infections which can 
be fatal in chicks. The virus can also cause permanent 
damage to the oviduct, especially in chicks, leading to 
reduced egg production and quality; and kidney, sometimes 
leading to kidney disease which can be fatal. 

IBV has been reported to be responsible for more eco- 
nomic loss to the poultry industry than any other infectious 
disease. Although live attenuated vaccines and inactivated 
vaccines are universally used in the control of IBV, the 
protection gained by use of vaccination can be lost either due 
to vaccine breakdown or the introduction of a new IBV 
serotype that is not related to the vaccine used, posing a risk 
to the poultry industry. 

Further, there is a need in the industry to develop vaccines 
which are suitable for use in ovo, in order to improve the 
efficiency and cost-effectiveness of vaccination pro- 
grammes. A major challenge associated with in ovo vacci- 
nation is that the virus must be capable of replicating in the 
presence of maternally-derived antibodies against the virus, 
without being pathogenic to the embryo. Current IBV vac- 
cines are derived following multiple passage in embryonated 
eggs, this results in viruses with reduced pathogenicity for 
chickens, so that they can be used as live attenuated vac- 
cines. However such viruses almost always show an 
increased virulence to embryos and therefore cannot be used 
for in ova vaccination as they cause reduced hatchability. A 
70% reduction in hatchability is seen in some cases. 

Attenuation following multiple passage in embryonated 
eggs also suffers from other disadvantages. It is an empirical 
method, as attenuation of the viruses is random and will 
differ every time the virus is passaged, so passage of the 
same virus through a different series of eggs for attenuation 
purposes will lead to a different set of mutations leading to 
attenuation. There are also efficacy problems associated with 
the process: some mutations will affect the replication of the 
virus and some of the mutations may make the virus too 
attenuated. Mutations can also occur in the S gene which 
may also affect immunogenicity so that the desired immune 
response is affected and the potential vaccine may not 
protect against the reguired serotype. In addition there are 
problems associated with reversion to virulence and stability 
of vaccines. 
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It is important that new and safer vaccines are developed 
for the control of IBV. Thus there is a need for IBV vaccines 
which are not associated with these issues, in particular 
vaccines which may be used for in ovo vaccination. 


SUMMARY OF ASPECTS OF THE INVENTION 


The present inventors have used a reverse genetics 
approach in order to rationally attenuate IBV. This approach 
is much more controllable than random attenuation follow- 
ing multiple passages in embryonated eggs because the 
position of each mutation is known and its effect on the 
virus, i.e. the reason for attenuation, can be derived. 

Using their reverse genetics approach, the present inven- 
tors have identified various mutations which cause the virus 
to have reduced levels of pathogenicity. The levels of 
pathogenicity may be reduced such that when the virus is 
administered to an embryonated egg, it is capable of repli- 
cating without being pathogenic to the embryo. Such viruses 
may be suitable for in ovo vaccination, which is a significant 
advantage and has improvement over attenuated IBV vac- 
cines produced following multiple passage in embryonated 
eggs. 

Thus in a first aspect, the present invention provides a 
live, attenuated coronavirus comprising a variant replicase 
gene encoding polyproteins comprising a mutation in one or 
more of non-structural protein(s) (nsp)-10, nsp-14, nsp-15 or 
nsp-16. 

The variant replicase gene may encode a protein com- 
prising one or more amino acid mutations selected from the 
list of: 

Pro to Leu at position 85 of SEQ ID NO: 6, 

Val to Leu at position 393 of SEQ ID NO: 7; 

Leu to Ile at position 183 of SEQ ID NO: 8; 

Val to Ile at position 209 of SEQ ID NO: 9. 

The replicase gene may encode a protein comprising the 
amino acid mutation Pro to Leu at position 85 of SEQ ID 
NO: 6. 

The replicase gene may encode a protein comprising the 
amino acid mutations Val to Leu at position 393 of SEQ ID 
NO: 7; Leu to Ile at position 183 of SEQ ID NO: 8; and Val 
to Ile at position 209 of SEQ ID NO: 9. 

The replicase gene may encodes a protein comprising the 
amino acid mutations Pro to Leu at position 85 of SEQ ID 
NO: 6; Val to Leu at position 393 of SEQ ID NO:7; Leu to 
Ile at position 183 of SEQ ID NO:8; and Val to Ile at position 
209 of SEQ ID NO: 9. 

The replicase gene may comprise one or more nucleotide 
substitutions selected from the list of: 

C to T at nucleotide position 12137; 

G to C at nucleotide position 18114; 

T to A at nucleotide position 19047; and 

G to A at nucleotide position 20139; 

compared to the sequence shown as SEQ ID NO: 1. 

The coronavirus may be an infectious bronchitis virus 
(IBV). 

The coronavirus may be IBV M41. 

The coronavirus may comprise an S protein at least part 
of which is from an IBV serotype other than M41. 

For example, the S1 subunit or the entire S protein may 
be from an IBV serotype other than M41. 

The coronavirus according to the first aspect of the 
invention has reduced pathogenicity compared to a corona- 
virus expressing a corresponding wild-type replicase, such 
that when the virus is administered to an embryonated egg, 
it is capable of replicating without being pathogenic to the 
embryo. 
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In a second aspect, the present invention provides a 
variant replicase gene as defined in connection with the first 
aspect of the invention. 

In a third aspect, the present invention provides a protein 
encoded by a variant coronavirus replicase gene according 
to the second aspect of the invention. 

In a fourth aspect, the present invention provides a 
plasmid comprising a replicase gene according to the second 
aspect of the invention. 

In a fifth aspect, the present invention provides a method 
for making the coronavirus according to the first aspect of 
the invention which comprises the following steps: 

(1) transfecting a plasmid according to the fourth aspect of 

the invention into a host cell; 

(11) infecting the host cell with a recombining virus 
comprising the genome of a coronavirus strain with a 
replicase gene; 

(iii) allowing homologous recombination to occur 
between the replicase gene sequences in the plasmid 
and the corresponding sequences in the recombining 
virus genome to produce a modified replicase gene; and 

(iv) selecting for recombining virus comprising the modi- 
fied replicase gene. 

The recombining virus may be a vaccinia virus. 

The method may also include the step: 

(v) recovering recombinant coronavirus comprising the 
modified replicase gene from the DNA from the recom- 
bining virus from step (iv). 

In a sixth aspect, the present invention provides a cell 
capable of producing a coronavirus according to the first 
aspect of the invention. 

In a seventh aspect, the present invention provides a 
vaccine comprising a coronavirus according to the first 
aspect of the invention and a pharmaceutically acceptable 
carrier. 

In an eighth aspect, the present invention provides a 
method for treating and/or preventing a disease in a subject 
which comprises the step of administering a vaccine accord- 
ing to the seventh aspect of the invention to the subject. 

Further aspects of the invention provide: 

the vaccine according to the seventh aspect of the inven- 
tion for use in treating and/or preventing a disease in a 
subject. 

use of a coronavirus according to the first aspect of the 
invention in the manufacture of a vaccine for treating 
and/or preventing a disease in a subject. 

The disease may be infectious bronchitis (IB). 

The method of administration of the vaccine may be 
selected from the group consisting of; eye drop administra- 
tion, intranasal administration, drinking water administra- 
tion, post-hatch injection and in ovo injection. 

Vaccination may be by in ova vaccination. 

The present invention also provides a method for produc- 
ing a vaccine according to the seventh aspect of the inven- 
tion, which comprises the step of infecting a cell according 
to the sixth aspect of the invention with a coronavirus 
according to the first aspect of the invention. 


DESCRIPTION OF THE FIGURES 


FIG. 1—Growth kinetics of M41-R-6 and M41-R-12 
compared to M41-CK (M41 EP4) on CK cells 

FIG. 2—Clinical signs, snicking and wheezing, associ- 
ated with M41-R-6 and M41-R-12 compared to M41-CK 
(M41 EP4) and Beau-R (Bars show mock, Beau-R, M41-R 
6, M41-R 12, M41-CK EP4 from left to right of each 
timepoint). 
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FIG. 3—Ciliary activity of the viruses in tracheal rings 
isolated from tracheas taken from infected chicks. 10096 
ciliary activity indicates no effect by the virus; apathogenic, 
0% activity indicates complete loss of ciliary activity, com- 
plete ciliostasis, indicating the virus is pathogenic (Bars 
show mock, Beau-R, M41-R 6, M41-R 12, M41-CK EP4 
from left to right of each timepoint). 

FIG. 4—Clinical signs, snicking, associated with M41R- 
nsplOrep and M41R-nsp14,15,16rep compared to M41-R- 
12 and M41-CK (M41 EPS) (Bars show mock, M41-R12; 
M41R-nsp10rep; M41R-nsp14,15,16rep and M41-CK EPS 
from left to right of each timepoint). 

FIG. 5—Ciliary activity of M41R-nsp10rep апа M41R- 
nsp14,15,16rep compared to M41-R-12 and M41-CK in 
tracheal rings isolated from tracheas taken from infected 
chicks (Bars show mock; M41-R12; M41R-nsp10rep; 
M41R-nsp14,15,16rep and M41-CK EPS from left to right 
of each timepoint). 

FIG. 6—Clinical signs, snicking, associated with M41R- 
пвр10, 15rep, М41К-пвр10, 14, 15гер, M41R-nsp10, 14, 
16rep, M41R-nsp10, 15, 16rep and M41-K compared to 
M41-CK (Bars show mock, M41R-nsp10,15rep1; M41R- 
nsp10,14,16rep4; M41R-nsp10,15,16rep8; M41R-nsp10,14, 
15rep10; M41-K6 and M41-CK EP4 from left to right of 
each timepoint). 

FIG. 7—Clinical signs, wheezing, associated with M41R- 
nsp10, 15rep, M41R-nsp10, 14, 15rep, М41К-пвр10, 14, 
16rep, M41R-nsp10, 15, 16rep and M41-K compared to 
M41-CK (Bars show mock, M41R-nsp10,15rep1; M14R- 
nsp10,14,16rep4; M41R-nsp10,15,16rep8; M41R-nsp10,14, 
15rep10; M41-K6 and M41-CK EP4 from left to right of 
each timepoint). 

FIG. 8—Ciliary activity of M41R-nsp10, 15тер, M41R- 
nsp10, 14, 15rep, М41К-пвр10, 14, 16rep, M41R-nsp10, 15, 
16rep and M41-K compared to M41-CK in tracheal rings 
isolated from tracheas taken from infected chicks (Bars 
show mock, M41R-nsp10,15rep1; M41R-nsp10,14,16rep4; 
M41R-nsp10,15,16rep8; M41R-nsp10,14,15rep10; M41- 
K6 and M41-CK EP4 from left to right of each timepoint). 

FIG. 9—Growth kinetics of rIBVs compared to M41-CK 
on CK cells. FIG. 9A shows the results for M41-R and 
M41-K. FIG. 9B shows the results for M41-nsp10 rep; 
M41R-nsp14, 15, 16 rep; MAIR-nsplO, 15 rep; M41R- 
nsplO, 15, 16 rep; М41В-пзр10, 14, 15 rep; and M41R- 
пвр10, 14, 16. 

FIG. 10—Position of amino acid mutations in mutated 
nsp10, nsp14, nsp15 and nsp16 sequences. 

FIG. 11—A) Snicking; B) Respiratory symptoms (wheez- 
ing and rales combined) and C) Ciliary activity of rIBV 
M41R-nsp 10,14 rep and rIBV M41R-nsp 10,16 rep com- 
pared to M41-CK (Bars show mock, M41R-nsp10,14rep; 
M41R-nsp10,16rep and M41-K from left to right of each 
timepoint). 


DETAILED DESCRIPTION 


The present invention provides a coronavirus comprising 
a variant replicase gene which, when expressed in the 
coronavirus, causes the virus to have reduced pathogenicity 
compared to a corresponding coronavirus which comprises 
the wild-type replicase gene. 

Coronavirus 

Gammacoronavirus is a genus of animal virus belonging 
to the family Coronaviridae. Coronaviruses are enveloped 
viruses with a positive-sense single-stranded RNA genome 
and a helical symmetry. 
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The genomic size of coronaviruses ranges from approxi- 
mately 27 to 32 kilobases, which is the longest size for any 
known RNA virus. 

Coronaviruses primarily infect the upper respiratory or 
gastrointestinal tract of mammals and birds. Five to six 
different currently known strains of coronaviruses infect 
humans. The most publicized human coronavirus, SARS- 
CoV which causes severe acute respiratory syndrome 
(SARS), has a unique pathogenesis because it causes both 
upper and lower respiratory tract infections and can also 
cause gastroenteritis. Middle Fast respiratory syndrome 
coronavirus (MERS-CoV) also causes a lower respiratory 
tract infection in humans. Coronaviruses are believed to 
cause a significant percentage of all common colds in human 
adults. 

Coronaviruses also cause a range of diseases in livestock 
animals and domesticated pets, some of which can be 
serious and are a threat to the farming industry. Economi- 
cally significant coronaviruses of livestock animals include 
infectious bronchitis virus (IBV) which mainly causes respi- 
ratory disease in chickens and seriously affects the poultry 
industry worldwide; porcine coronavirus (transmissible gas- 
troenteritis, TGE) and bovine coronavirus, which both result 
in diarrhoea in young animals. Feline coronavirus has two 
forms, feline enteric coronavirus is a pathogen of minor 
clinical significance, but spontaneous mutation of this virus 
can result in feline infectious peritonitis (FIP), a disease 
associated with high mortality. 

There are also two types of canine coronavirus (CCoV), 
one that causes mild gastrointestinal disease and one that has 
been found to cause respiratory disease. Mouse hepatitis 
virus (MHV) is a coronavirus that causes an epidemic 
murine illness with high mortality, especially among colo- 
nies of laboratory mice. 

Coronaviruses are divided into four groups, as shown 
below: 

Alpha 

Canine coronavirus (CCoV) 

Feline coronavirus (FeCoV) 

Human coronavirus 229E (HCoV-229E) 

Porcine epidemic diarrhoea virus (PEDV) 

Transmissible gastroenteritis virus (TGEV) 

Human Coronavirus NL63 (NL or New Haven) 

Beta 

Bovine coronavirus (BCoV) 

Canine respiratory coronavirus (CRCoV)—Common 
in SE Asia and Micronesia 

Human coronavirus OC43 (HCoV-OC43) 

Mouse hepatitis virus (MHV) 

Porcine haemagglutinating encephalomyelitis virus 
(HEV) 

Rat coronavirus (Roy). Rat Coronavirus is quite preva- 
lent in Eastern Australia where, as of March/April 
2008, it has been found among native and feral 
rodent colonies. 

(No common name as of yet) (HCoV-HKUI) 

Severe acute respiratory syndrome coronavirus 
(SARS-CoV) 

Middle East respiratory syndrome coronavirus (MERS- 
CoV) 

Gamma 

Infectious bronchitis virus (IBV) 

Turkey coronavirus (Bluecomb disease virus) 

Pheasant coronavirus 

Guinea fowl coronavirus 
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Delta 
Bulbul coronavirus (BuCoV) 
Thrush coronavirus (ThCoV) 
Munia coronavirus (MuCoV) 
Porcine coronavirus (PorCov) HKU15 

The variant replicase gene of the coronavirus of the 
present invention may be derived from an alphacoronavirus 
such as TGEV; a betacoronavirus such as MHV; or a 
gammacoronavirus such as IBV. 

As used herein the term “derived from” means that the 
replicase gene comprises substantially the same nucleotide 
seguence as the wild-type replicase gene of the relevant 
coronavirus. For example, the variant replicase gene of the 
present invention may have up to 80%, 85%, 90%, 95%, 
98% or 99% identity with the wild type replicase seguence. 
The variant coronavirus replicase gene encodes a protein 
comprising a mutation in one or more of non-structural 
protein (nsp)-10, nsp-14, nsp-15 or nsp-16 when compared 
to the wild-type seguence of the non-structural protein. 

IBV 

Avian infectious bronchitis (IB) is an acute and highly 
contagious respiratory disease of chickens which causes 
significant economic losses. The disease is characterized by 
respiratory signs including gasping, coughing, sneezing, 
tracheal rales, and nasal discharge. In young chickens, 
severe respiratory distress may occur. In layers, respiratory 
distress, nephritis, decrease in egg production, and loss of 
internal egg guality and egg shell guality are common. 

In broilers, coughing and rattling are common clinical 
signs, rapidly spreading in all the birds of the premises. 
Morbidity is 100% in non-vaccinated flocks. Mortality var- 
ies depending on age, virus strain, and secondary infections 
but may be up to 60% in non-vaccinated flocks. 

The first IBV serotype to be identified was Massachusetts, 
but in the United States several serotypes, including Arkan- 
sas and Delaware, are currently circulating, in addition to the 
originally identified Massachusetts type. 

The IBV strain Beaudette was derived following at least 
150 passages in chick embryos. IBV Beaudette is no longer 
pathogenic for hatched chickens but rapidly kills embryos. 

H120 is a commercial live attenuated IBV Massachusetts 
serotype vaccine strain, attenuated by approximately 120 
passages in embryonated chicken eggs. H52 is another 
Massachusetts vaccine, and represents an earlier and slightly 
more pathogenic passage virus (passage 52) during the 
development of H120. Vaccines based on H120 are com- 
monly used. 

IB QX is a virulent field 1solate of IBV. It is sometimes 
known as “Chinese QX" as it was originally isolated fol- 
lowing outbreaks of disease in the Qingdao region in China 
in the mid 1990s. Since that time the virus has crept towards 
Europe. From 2004, severe egg production issues have been 
identified with a very similar virus in parts of Western 
Europe, predominantly in the Netherlands, but also reported 
from Germany, France, Belgium, Denmark and in the UK. 

The virus isolated from the Dutch cases was identified by 
the Dutch Research Institute at Deventer as a new strain that 
they called D388. The Chinese connection came from fur- 
ther tests which showed that the virus was 99% similar to the 
Chinese QX viruses. A live attenuated QX-like IBV vaccine 
strain has now been developed. 

IBV is an enveloped virus that replicates in the cell 
cytoplasm and contains an non-segmented, single-stranded, 
positive sense RNA genome. IBV has a 27.6 kb RNA 
genome and like all coronaviruses contains the four struc- 
tural proteins; spike glycoprotein (S), small membrane pro- 
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tein (E), integral membrane protein (M) and nucleocapsid 
protein (N) which interacts with the genomic RNA. 


The genome is organised in the following manner: 
S'UTR—polymerase (replicase) gene— structural protein 
genes (S-E-M-N)—UTR 3'; where the UTR are untranslated 
regions (each —500 nucleotides in IBV). 


The lipid envelope contains three membrane proteins: S, 
M and E. The IBV S protein is a type I glycoprotein which 
oligomerizes in the endoplasmic reticulum and is assembled 
into homotrimer inserted in the virion membrane via the 
transmembrane domain and is associated through non-co- 
valent interactions with the M protein. Following incorpo- 
ration into coronavirus particles, the S protein is responsible 
for binding to the target cell receptor and fusion of the viral 
and cellular membranes. The S glycoprotein consists of four 
domains: a signal sequence that is cleaved during synthesis; 
the ectodomain, which is present on the outside of the virion 
particle; the transmembrane region responsible for anchor- 
ing the S protein into the lipid bilayer of the virion particle; 
and the cytoplasmic tail. 


АП coronaviruses also encode a set of accessory protein 
genes of unknown function that are not required for repli- 
cation in vitro, but may play a role in pathogenesis. IBV 
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encodes two accessory genes, genes 3 and 5, which both 
express two accessory proteins 3a, 3b and 5a, 5b, respec- 
tively. 

The variant replicase gene of the coronavirus of the 
present invention may be derived from an IBV. For example 
the IBV may be IBV Beaudette, H120, H52, IB QX, D388 
or M41. 

The IBV may be IBV M41. M41 is a prototypic Massa- 
chusetts serotype that was isolated in the USA in 1941. It is 
an isolate used in many labs throughout the world as a 
pathogenic lab stain and can be obtained from ATCC (VR- 
21TM). Attenuated variants are also used by several vaccine 
producers as IBV vaccines against Massachusetts serotypes 
causing problems in the field. The present inventors chose to 
use this strain as they had worked for many years on this 
virus, and because the sequence of the complete virus 
genome is available. The M41 isolate, M41-CK, used by the 
present inventors was adapted to grow in primary chick 
kidney (CK) cells and was therefore deemed amenable for 
recovery as an infectious virus from a cDNA ofthe complete 
genome. It is representative of a pathogenic IBV and there- 
fore can be analysed for mutations that cause either loss or 
reduction in pathogenicity. 

The genome sequence of IBV M41-CK is provided as 
SEQ ID NO: 1. 


IBV M41-CK Sequence 
SEQ ID NO: 1 
ACTTAAGATAGATATTAATATATATCTATCACACTAGCCTTGCGCTAGATTTCCAACTTA 
ACAAAACGGACTTAAATACCTACAGCTGGTCCTCATAGGTGTTCCATTGCAGTGCACTTT 
AGTGCCCTGGATGGCACCTGGCCACCTGTCAGGTTTTTGTTATTAAAATCTTATTGTTGC 
TGGTATCACTGCTTGTTTTGCCGTGTCTCACTTTATACATCCGTTGCTTGGGCTACCTAG 
TATCCAGCGTCCTACGGGCGCCGTGGCTGGTTCGAGTGCGAAGAACCTCTGGTTCATCTA 
GCGGTAGGCGGGTGTGTGGAAGTAGCACTTCAGACGTACCGGTTCTGTTGTGTGAAATAC 
GGGGTCACCTCCCCCCACATACCTCTAAGGGCTTTTGAGCCTAGCGTTGGGCTACGTTCT 
CGCATAAGGTCGGCTATACGACGTTTGTAGGGGGTAGTGCCAAACAACCCCTGAGGTGAC 
AGGTTCTGGTGGTGTTTAGTGAGCAGACATACAATAGACAGTGACAACATGGCTTCAAGC 
CTAAAACAGGGAGTATCTGCGAAACTAAGGGATGTCATTGTTGTATCCAAAGAGATTGCT 
GAACAACTTTGTGACGCTTTGTTTTTCTATACGTCACACAACCCTAAGGATTACGCTGAT 
GCTTTTGCAGTTAGGCAGAAGTTTGATCGTAATCTGCAGACTGGGAAACAGTTCAAATTT 
GAAACTGTGTGTGGTCTCTTCCTCTTGAAGGGAGTTGACAAAATAACACCTGGCGTCCCA 
GCAAAAGTCTTAAAAGCCACTTCTAAGTTGGCAGATTTAGAAGACATCTTTGGTGTCTCT 
CCCTTTGCAAGAAAATATCGTGAACTTTTGAAGACAGCATGCCAGTGGTCTCTTACTGTA 
GAAACACTGGATGCTCGTGCACAAACTCTTGATGAAATTTTTGACCCTACTGAAATACTT 
TGGCTTCAGGTGGCAGCAAAAATCCAAGTTTCGGCTATGGCGATGCGCAGGCTTGTTGGA 
GAAGTAACTGCAAAAGTCATGGATGCTTTGGGCTCAAATATGAGTGCTCTTTTCCAGATT 


TTTAAACAACAAATAGTCAGAATTTTTCAAAAAGCGCTGGCTATTTTTGAGAATGTGAGT 





GAATTACCACAGCGTATTGCAGCACTTAAGATGGCTTTTGCTAAGTGTGCCAAGTCCATT 





ACTGTTGTGGTTATGGAGAGGACTCTAGTTGTTAGAGAGTTCGCAGGAACTTGTCTTGCA 





AGCATTAATGGTGCTGTTGCAAAATTCTTTGAAGAACTCCCAAATGGTTTCATGGGTGCT 





AAAATTTTCACTACACTTGCCTTCTTTAGGGAGGCTGCAGTGAAAATTGTGGATAACATA 





CCAAATGCACCGAGAGGCACTAAAGGGTTTGAAGTCGTTGGTAATGCCAAAGGTACACAA 
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-continued 
GTTGTTGTGCGTGGCATGGGAAATGACTTAACACTGGTTGAGCAAAAAGCTGAAATTGCT 


GTGGAGTCAGAAGGTTGGTCTGCAATTTTGGGTGGACATCTTTGCTATGTCTTTAAGAGT 
GGTGATCGCTTTTACGCGGCACCTCTTTCAGGAAATTTTGCATTGCATGATGTGCATTGT 
TGTGAGCGTGTTGTCTGTCTTTCTGATGGTGTAACACCGGAGATAAATGATGGACTTATT 
CTTGCAGCAATCTACTCTTCTTTTAGTGTCGCAGAACTTGTGGCAGCCATTAAAAGGGGT 
GAACCATTTAAGTTTCTGGGTCATAAATTTGTGTATGCAAAGGATGCAGCAGTTTCTTTT 
ACATTAGCGAAGGCTGCTACTATTGCAGATGTTTTGAAGCTGTTTCAATCAGCGCGTGTG 
AAAGTAGAAGATGTTTGGTCTTCACTTACTGAAAAGTCTTTTGAATTCTGGAGGCTTGCA 
TATGGAAAAGTGCGTAATCTCGAAGAATTTGTTAAGACTTGTTTTTGTAAGGCTCAAATG 
GCGATTGTGATTTTAGCGACAGTGCTTGGAGAGGGCATTTGGCATCTTGTTTCGCAAGTC 
ATCTATAAAGTAGGTGGTCTTTTTACTAAAGTTGTTGACTTTTGTGAAAAATATTGGAAA 
GGTTTTTGTGCACAGTTGAAAAGAGCTAAGCTCATTGTCACTGAAACCCTCTGTGTTTTG 
AAAGGAGTTGCACAGCATTGTTTTCAACTATTGCTGGATGCAATACAGTTTATGTATAAA 
AGTTTTAAGAAGTGTGCACTTGGTAGAATCCATGGAGACT TGCTCTTCTGGAAAGGAGGT 
GTGCACAAAATTATTCAAGAGGGCGATGAAATTTGGTTTGAGGGCATTGATAGTATTGAT 
GTTGAAGATCTGGGTGTTGTTCAAGAAAAATTGATTGATTTTGATGTTTGTGATAATGTG 
ACACTTCCAGAGAACCAACCCGGTCATATGGTTCAAATCGAGGATGACGGAAAGAACTAC 
ATGTTCTTCCGCTTCAAAAAGGATGAGAACATTTATTATACACCAATGTCACAGCTTGGT 
GCTATTAATGTGGTTTGCAAAGCAGGCGGTAAAACTGTCACCTTTGGAGAAACTACTGTG 
CAAGAAATACCACCACCTGATGTTGTGTTTATTAAGGTTAGCATTGAGTGTTGTGGTGAA 
CCATGGAATACAATCTTCAAAAAGGCTTATAAGGAGCCCATTGAAGTAGAGACAGACCTC 
ACAGTTGAACAATTGCTCTCTGTGGTCTATGAGAAAATGTGTGATGATCTCAAGCTGTTT 
CCGGAGGCTCCAGAACCACCACCATTTGAGAATGTCACACTTGTTGATAAGAATGGTAAA 
GATTTGGATTGCATAAAATCATGCCATCTGATCTATCGTGATTATGAGAGCGATGATGAC 
ATCGAGGAAGAAGATGCAGAAGAATGTGACACGGATTCAGGTGATGCTGAGGAGTGTGAC 
ACTAATTCAGAATGTGAAGAAGAAGATGAGGATACTAAAGTGTTGGCTCTTATACAAGAC 
CCGGCAAGTAACAAATATCCTCTGCCTCTTGATGATGATTATAGCGTCTACAATGGATGT 
ATTGTTCATAAGGACGCTCTCGATGTTGTGAATTTACCATCTGGTGAAGAAACCTTTGTT 
GTCAATAACTGCTTTGAAGGGGCTGTTAAAGCTCTTCCGCAGAAAGTTATTGATGTTCTA 
GGTGACTGGGGTGAGGCTGTTGATGCGCAAGAACAATTGTGTCAACAAGAATCAACTCGG 
GTCATATCTGAGAAATCAGTTGAGGGTTTTACTGGTAGTTGTGATGCAATGGCTGAACAA 
GCTATTGTTGAAGAGCAGGAAATAGTACCTGTTGTTGAACAAAGTCAGGATGTAGTTGTT 
TTTACACCTGCAGACCTAGAAGTTGTTAAAGAAACAGCAGAAGAGGTTGATGAGTTTATT 
CTCATTTCTGCTGTCCCTAAAGAAGAAGTTGTGTCTCAGGAGAAAGAGGAGCCACAGGTT 
GAGCAAGAGCCTACCCTAGTTGTTAAAGCACAACGTGAGAAGAAGGCTAAAAAGTTCAAA 


GTTAAACCAGCTACATGTGAAAAACCCAAATTTTTGGAGTACAAAACATGTGTGGGTGAT 





TTGGCTGTTGTAATTGCCAAAGCATTGGATGAGTTTAAAGAGTTCTGCATTGTAAACGCT 


GCAAATGAGCACATGTCGCATGGTGGTGGCGTTGCAAAGGCAATTGCAGACTTTTGTGGA 


CCGGACTTTGTTGAATATTGCGCGGACTATGTTAAGAAACATGGTCCACAGCAAAAACTT 


GTCACACCTTCATTTGTTAAAGGCATTCAATGTGTGAATAATGTTGTAGGACCTCGCCAT 


GGAGACAGCAACTTGCGTGAGAAGCTTGTTGCTGCTTACAAGAGTGTTCTTGTAGGTGGA 
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-continued 


GTGGTTAACTATGTTGTGCCAGTTCTCTCATCAGGGATTTTTGGTGTAGATTTTAAAATA 
TCAATAGATGCTATGCGCGAAGCTTTTAAAGGTTGTGCCATACGCGTTCTTTTATTTTCT 
CTGAGTCAAGAACACATCGATTATTTCGATGCAACTTGTAAGCAGAAGACAATTTATCTT 
ACGGAGGATGGTGTTAAATACCGCTCTGTTGTTTTAAAACCTGGTGATTCTTTGGGTCAA 


TTTGGACAGGTTTTTGCAAGAAATAAGGTAGTCTTTTCGGCTGATGATGTTGAGGATAAA 





GAAATCCTCTTTATACCCACAACTGACAAGACTATTCTTGAATATTATGGTTTAGATGCG 


CAAAAGTATGTAACATATTTGCAAACGCTTGCGCAGARATGGGATGTTCAATATAGAGAC 


AATTTTGTTATATTAGAGTGGCGTGACGGAAATTGCTGGATTAGTTCAGCAATAGTTCTC 


CTTCAAGCTGCTAAAATTAGATTTAAAGGTTTTCTTGCAGAAGCATGGGCTAAACTGTTG 


GGTGGAGATCCTACAGACTTTGTTGCCTGGTGTTATGCAAGTTGCAATGCTAAAGTAGGT 


GATTTTTCAGATGCTAATTGGCTTTTGGCCAATTTAGCAGAACATTTTGACGCAGATTAC 


ACAAATGCACTTCTTAAGAAGTGTGTGTCGTGCAATTGTGGTGTTAAGAGTTATGAACTT 


AGGGGTCTTGAAGCCTGTATTCAGCCAGTTCGAGCACCTAATCTTCTACATTTTAAAACG 


CAATATTCAAATTGCCCAACCTGTGGTGCAAGTAGTACGGATGAAGTAATAGAAGCTTCA 


TTACCGTACTTATTGCTTTTTGCTACTGATGGTCCTGCTACAGTTGATTGTGATGAAAAT 


GCTGTAGGGACTGTTGTTTTCATTGGCTCTACTAATAGTGGCCATTGTTATACACAAGCC 


GATGGTAAGGCTTTTGACAATCTTGCTAAGGATAGAAAATTTGGAAGGAAGTCGCCTTAC 


ATTACAGCAATGTATACACGTTTTTCTCTTAGGAGTGAAAATCCCCTACTTGTTGTTGAA 


CATAGTAAGGGTAAAGCTAAAGTAGTAAAAGAAGATGTTTCTAACCTTGCTACTAGTTCT 


AAAGCCAGTTTTGACGATCTTACTGACTTTGAACACTGGTATGATAGCAACATCTATGAG 


AGTCTTAAAGTGCAGGAGACACCTGATAATCTTGATGAATATGTGTCATTTACGACAAAG 


GAAGATTCTAAGTTGCCACTGACACTTAAAGTTAGAGGTATCAAATCAGTTGTTGACTTT 


AGGTCTAAGGATGGTTTTACTTATAAGTTAACACCTGATACTGATGAAAATTCAAAAACA 


CCAGTCTACTACCCAGTCTTGGATTCTATTAGTCTTAGGGCAATATGGGTTGAAGGCAGT 


GCTAATTTTGTTGTTGGGCATCCAAATTATTATAGTAAGTCTCTCCGAATTCCCACGTTT 


TGGGAAAATGCCGAGAGCTTTGTTAAAATGGGTTATAAAATTGATGGTGTAACTATGGGC 


CTTTGGCGTGCAGAACACCTTAATAAACCTAATTTGGAGAGAATTTTTAACATTGCTAAG 


AAAGCTATTGTTGGATCTAGTGTTGTTACTACGCAGTGTGGTAAAATACTAGTTAAAGCA 


GCTACATACGTTGCCGATAAAGTAGGTGATGGTGTAGTTCGCAATATTACAGATAGAATT 


AAGGGTCTTTGTGGATTCACACGTGGCCATTTTGAAAAGAAAATGTCCCTACAATTTCTA 


AAGACACTTGTGTTCTTTTTCTTTTATTTCTTAAAGGCTAGTGCTAAGAGTTTAGTTTCT 


AGCTATAAGATTGTGTTATGTAAGGTGGTGTTTGCTACCTTACTTATAGTGTGGTTTATA 


TACACAAGTAATCCAGTAGTGTTTACTGGAATACGTGTGCTAGACTTCCTATTTGAAGGT 


TCTTTATGTGGTCCTTATAATGACTACGGTAAAGATTCTTTTGATGTGTTACGGTATTGT 


GCAGGTGATTTTACTTGTCGTGTGTGTTTACATGATAGAGATTCACTTCATCTGTACAAA 


CATGCTTATAGCGTAGAACAAATTTATAAGGATGCAGCTTCTGGCATTAACTTTAATTGG 


AATTGGCTTTATTTGGTCTTTCTAATATTATTTGTTAAGCCAGTGGCAGGTTTTGTTATT 





ATTTGTTATTGTGTTAAGTATTTGGTATTGAGTTCAACTGTGTTGCAAACTGGTGTAGGT 





TTTCTAGATTGGTTTGTAAAAACAGTTTTTACCCATTTTAATTTTATGGGAGCGGGATTT 








TATTTCTGGCTCTTTTACAAGATATACGTACAAGTGCATCATATATTGTACTGTAAGGAT 
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-continued 
GTAACATGTGAAGTGTGCAAGAGAGTTGCACGCAGCAACAGGCAAGAGGTTAGCGTTGTA 


GTTGGTGGACGCAAGCAAATAGTGCATGTTTACACTAATTCTGGCTATAACTTTTGTAAG 
AGACATAATTGGTATTGTAGAAATTGTGATGATTATGGTCACCAAAATACATTTATGTCC 
CCTGAAGTTGCTGGCGAGCTTTCTGAAAAGCTTAAGCGCCATGTTAAACCTACAGCATAT 


GCTTACCACGTTGTGTATGAGGCATGCGTGGTTGATGATTTTGTTAATTTAAAATATAAG 





GCTGCAATTGCTGGTAAGGATAATGCATCTTCTGCTGTTAAGTGTTTCAGTGTTACAGAT 


TTTTTAAAGAAAGCTGTTTTTCTTAAGGAGGCATTGAAATGTGAACAAATATCTAATGAT 





GGTTTTATAGTGTGTAATACACAGAGTGCGCATGCACTAGAGGAAGCAAAGAATGCAGCC 


GTCTATTATGCGCAATATCTGTGTAAGCCAATACTTATACTTGACCAGGCACTTTATGAG 


CAATTAATAGTAGAGCCTGTGTCTAAGAGTGTTATAGATAAAGTGTGTAGCATTTTGTCT 


AATATAATATCTGTAGATACTGCAGCTTTAAATTATAAGGCAGGCACACTTCGTGATGCT 


CTGCTTTCTATTACTAAAGACGAAGAAGCCGTAGATATGGCTATCTTCTGCCACAATCAT 


GAAGTGGAATACACTGGTGACGGTTTTACTAATGTGATACCGTCATATGGTATGGACACT 


GATAAGTTGACACCTCGTGATAGAGGGTTTTTGATAAATGCAGATGCTTCTATTGCTAAT 


TTAAGAGTCAAAAATGCTCCTCCGGTAGTATGGAAGTTTTCTGATCTTATTAAATTGTCT 


GACAGTTGCCTTAAATATTTAATTTCAGCTACTGTCAAGTCAGGAGGTCGTTTCTTTATA 


ACAAAGTCTGGTGCTAAACAAGTTATTTCTTGTCATACCCAGAAACTGTTGGTAGAGAAA 


AAGGCAGGTGGTGTTATTAATAACACTTTTAAATGGTTTATGAGTTGTTTTAAATGGCTT 


TTTGTCTTTTATATACTTTTTACAGCATGTTGTTTGGGTTACTACTATATGGAGATGAAT 


AAAAGTTTTGTTCACCCCATGTATGATGTAAACTCCACACTGCATGTTGAAGGGTTCAAA 


GTTATAGACAAAGGTGTTATTAGAGAGATTGTGTCAGAAGATAATTGTTTCTCTAATAAG 


TTTGTTAATTTTGACGCCTTTTGGGGTAAATCATATGAAAATAATAAAAACTGTCCAATT 


GTTACAGTTGTTATAGATGGTGACGGGACAGTAGCTGTTGGTGTTCCTGGTTTTGTATCA 


TGGGTTATGGATGGTGTTATGTTTGTGCATATGACACAGACTGATCGTAGACCTTGGTAC 


ATTCCTACCTGGTTTAATAGAGAAATTGTTGGTTACACTCAGGATTCAATTATCACTGAG 


GGTAGTTTTTATACATCTATAGCATTATTTTCTGCTAGATGTTTATATTTAACAGCCAGC 


AATACACCTCAATTGTATTGTTTTAATGGCGACAATGATGCACCTGGAGCCTTACCATTT 


GGTAGTATTATTCCTCATAGAGTATACTTCCAACCTAATGGTGTTAGGCTTATAGTTCCA 


CAACAAATACTGCATACACCCTACATAGTGAAGTTTGTTTCAGACAGCTATTGTAGAGGT 


AGTGTATGTGAGTATACTAAACCAGGTTACTGTGTGTCACTAGACTCCCAATGGGTTTTG 


TTTAATGATGAATACATTAGTAAACCTGGCGTTTTCTGTGGTTCTACTGTTAGAGAACTT 


ATGTTTAATATGGTTAGTACATTCTTTACTGGTGTCAACCCTAATATTTATATTCAGCTA 


GCAACTATGTTTTTAATACTAGTTGTTATTGTGTTAATTTTTGCAATGGTTATAAAGTTT 


CAAGGTGTTTTTAAAGCTTATGCGACCATTGTGTTTACAATAATGTTAGTTTGGGTTATT 


AATGCATTTGTTTTGTGTGTACATAGTTATAATAGTGTTTTAGCTGTTATATTATTAGTA 


CTCTATTGCTATGCATCATTGGTTACAAGTCGCAATACTGCTATAATAATGCATTGTTGG 


CTTGTTTTTACCTTTGGTTTAATAGTACCCACATGGTTGGCTTGTTGCTATCTGGGATTT 





ATTCTTTATATGTACACACCGTTGGTTTTCTGGTGTTACGGTACTACTAAAAATACTCGT 


AAGTTGTATGATGGCAACGAGTTTGTTGGTAATTATGACCTTGCTGCGAAGAGCACTTTT 


GTTATTCGTGGTACTGAATTTGTTAAGCTTACGAATGAGATAGGTGATAAATTTGAAGCC 














TATCTTTCTGCGTATGCTAGACTTAAATACTATTCAGGCACTGGTAGTGAGCAAGATTAC 
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-continued 


TTGCAAGCTTGTCGTGCATGGTTAGCTTATGCTTTGGACCAATATAGAAATAGTGGTGTT 
GAGGTTGTTTATACCCCACCGCGTTACTCTATTGGTGTTAGTAGACTACACGCTGGTTTT 
AAAAAACTAGTTTCTCCTAGTAGTGCTGTTGAGAAGTGCATTGTTAGTGTCTCTTATAGA 
GGCAATAATCTTAATGGACTGTGGCTGGGTGATTCTATTTACTGCCCACGCCATGTGTTA 


GGTAAGTTTAGTGGTGACCAGTGGGGTGACGTACTAAACCTTGCTAATAATCATGAGTTT 





GAAGTTGTAACTCAAAATGGTGTTACTTTGAATGTTGTCAGCAGGCGGCTTAAAGGAGCA 





GTTTTAATTTTACAAACTGCAGTTGCCAATGCTGAAACTCCTAAGTATAAGTTTGTTAAA 


GCTAATTGTGGTGATAGTTTCACTATAGCTTGTTCTTATGGTGGTACAGTTATAGGACTT 


TACCCTGTCACTATGCGTTCTAATGGTACTATTAGAGCATCTTTCCTAGCAGGAGCCTGT 


GGCTCAGTTGGTTTTAATATAGAAAAGGGTGTAGTTAATTTCTTTTATATGCACCATCTT 


GAGTTACCTAATGCATTACACACTGGAACTGACCTAATGGGTGAGTTTTATGGTGGTTAT 


GTAGATGAAGAGGTTGCGCAAAGAGTGCCACCAGATAATCTAGTTACTAACAATATTGTA 


GCATGGCTCTATGGGGCAATTATTAGTGTTAAAGAAAGTAGTTTTTCACAACCTAAATGG 


TTGGAGAGTACTACTGTTTCTATTGAAGATTACAATAGGTGGGCTAGTGATAATGGTTTT 


ACTCCATTTTCCACTAGTACTGCTATTACTAAATTAAGTGCTATAACTGGGGTTGATGTT 


TGTAAACTCCTTCGCACTATTATGGTAAAAAGTGCTCAATGGGGTAGTGATCCCATTTTA 


GGACAATATAATTTTGAAGACGAATTGACACCAGAATCTGTATTTAATCAAGTTGGTGGT 


GTTAGGTTACAGTCTTCTTTTGTAAGAAAAGCTACATCTTGGTTTTGGAGTAGATGTGTA 


TTAGCTTGCTTCTTGTTTGTGTTGTGTGCTATTGTCTTATTTACGGCAGTGCCACTTAAG 


TTTTATGTACATGCAGCTGTTATTTTGTTGATGGCTGTGCTCTTTATTTCTTTTACTGTT 


AAACATGTTATGGCATACATGGACACTTTCCTATTGCCTACATTGATTACAGTTATTATT 


GGAGTTTGTGCTGAAGTCCCTTTCATATACAATACTCTAATTAGTCAAGTTGTTATTTTC 


TTAAGCCAATGGTATGATCCTGTAGTCTTTGATACTATGGTACCATGGATGTTATTGCCA 


TTAGTGTTGTACACTGCTTTTAAGTGTGTACAAGGCTGCTATATGAATTCTTTCAATACT 


TCTTTGTTAATGCTGTATCAGTTTATGAAGTTAGGTTTTGTTATTTACACCTCTTGAAAC 


ACTCTTACTGCATATACAGAAGGTAATTGGGAGTTATTCTTTGAGTTGGTTCACACTATT 


GTGTTGGCTAATGTTAGTAGTAATTCCTTAATTGGTTTAATTGTTTTTAAGTGTGCTAAG 


TGGATTTTATATTATTGCAATGCAACATACTTTAATAATTATGTGTTAATGGCAGTCATG 


GTTAATGGCATAGGCTGGCTTTGCACCTGTTACTTTGGATTGTATTGGTGGGTTAATAAA 


GTTTTTGGTTTAACCTTAGGTAAATACAATTTTAAAGTTTCAGTAGATCAATATAGGTAT 


ATGTGTTTGCATAAGGTAAATCCACCTAAAACTGTGTGGGAGGTCTTTACTACAAATATA 


CTTATACAAGGAATTGGAGGCGATCGTGTGTTGCCTATAGCTACAGTGCAATCTAAATTG 


AGTGATGTAAAGTGTACAACTGTTGTTTTAATGCAGCTTTTGACTAAGCTTAATGTTGAA 


GCAAATTCAAAAATGCATGCTTATCTTGTTGAGTTACACAATAAAATCCTCGCATCTGAT 


GATGTTGGAGAGTGCATGGATAATTTATTGGGTATGCTTATAACACTATTTTGTATAGAT 


TCTACTATTGATTTGGGTGAGTATTGTGATGATATACTTAAGAGGTCAACTGTATTACAA 


TCGGTTACTCAAGAGTTTTCGCACATACCCTCGTATGCTGAATATGAAAGAGCTAAGAGT 





ATTTATGAAAAGGTTTTAGCCGATTCTAAAAATGGTGGTGTAACACAGCAAGAGCTTGCT 





GCATATCGTAAAGCTGCCAATATTGCAAAGTCAGTTTTTGATAGAGACTTGGCTGTTCAA 


AAGAAGTTAGATAGCATGGCAGAACGTGCTATGACAACAATGTATAAAGAGGCGCGTGTA 
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ACTGATAGAAGAGCAAAATTAGTTTCATCATTACATGCACTACTTTTTTCAATGCTTAAG 


AAAATAGATTCTGAGAAGCTTAATGTCTTATTTGACCAGGCGAATAGTGGTGTTGTACCC 
CTAGCAACTGTTCCAATTGTTTGTAGTAATAAGCTTACCCTTGTTATACCAGACCCAGAG 
ACGTGGGTCAAGTGTGTGGAGGGTGTGCATGTTACATATTCAACAGTTGTTTGGAATATA 
GACTGTGTTACTGATGCCGATGGCACAGAGTTACACCCCACTTCTACAGGTAGTGGATTG 
ACTTACTGTATAAGTGGTGATAATATAGCATGGCCTTTAAAGGTTAACTTGACTAGGAAT 


GGGCATAATAAGGTTGATGTTGCCTTGCAAAATAATGAGCTTATGCCTCACGGTGTAAAG 





ACAAAGGCTTGCGTAGCAGGTGTAGATCAAGCACATTGTAGCGTTGAGTCTAAATGTTAT 


TATACAAGTATTAGTGGCAGTTCAGTTGTAGCTGCTATTACCTCTTCAAATCCTAATCTG 


AAAGTAGCCTCTTTTTTGAATGAGGCAGGTAATCAGATTTATGTAGACTTAGACCGAGCA 


TGTAAATTTGGTATGAAAGTGGGTGATAAGGTTGAAGTTGTTTACCTGTATTTTATAAAA 


AATACGAGGTCTATTGTAAGAGGTATGGTACTTGGTGCTATATCTAATGTTGTTGTGTTA 


CAATCTAAAGGTCATGAGACAGAGGAAGTGGATGCTGTAGGCATTCTCTCACTTTGTTCT 


TTTGCAGTAGATCCTGCGGATACATATTGTAAATATGTGGCAGCAGGTAATCAACCTTTA 


GGTAACTGTGTTAAAATGTTGACAGTACATAATGGTAGTGGTTTTGCAATAACATCAAAG 


CCAAGTCCAACTCCGGATCAGGATTCTTATGGAGGAGCTTCTGTGTGTCTTTATTGTAGA 


GCACATATAGCACACCCTGGCGGAGCAGGAAATTTAGATGGACGCTGTCAATTTAAAGGT 


TCTTTTGTGCAAATACCTACTACGGAGAAAGATCCTGTTGGATTCTGTCTACGTAACAAG 


GTTTGCACTGTTTGTCAGTGTTGGATTGGTTATGGATGTCAGTGTGATTCACTTAGACAA 


CCTAAACCTTCTGTTCAGTCAGTTGCTGTTGCATCTGGTTTTGATAAGAATTATTTAAAC 


GGGTACGGGGTAGCAGTGAGGCTCGGCTGATACCCCTAGCTAATGGATGTGACCCCGATG 


TTGTAAAGCGAGCCTTTGATGTTTGTAATAAGGAATCAGCCGGTATGTTTCAAAATTTGA 


AGCGTAACTGTGCACGATTCCAAGAAGTACGTGATACTGAAGATGGAAATCTTGAGTATT 


GTGATTCTTATTTTGTGGTTAAACAAACCACTCCTAGTAATTATGAACATGAGAAAGCTT 


GTTATGAAGACTTAAAGTCAGAAGTAACAGCTGATCATGATTTCTTTGTGTTCAATAAGA 


ACATTTATAATATTAGTAGGCAGAGGCTTACTAAGTATACTATGATGGATTTTTGCTATG 


CTTTGCGGCACTTTGACCCAAAGGATTGCGAAGTTCTTAAAGAAATACTTGTCACTTATG 


GTTGTATAGAAGATTATCACCCTAAGTGGTTTGAAGAGAATAAGGATTGGTACGACCCAA 


TAGAAAACCCTAAATATTATGCCATGTTGGCTAAAATGGGACCTATTGTACGAGGTGCTT 


TATTGAATGCTATTGAGTTCGGAAACCTCATGGTTGAAAAAGGTTATGTTGGTGTTATTA 


CACTTGATAACCAAGATCTTAATGGCAAATTTTATGATTTTGGTGATTTTCAGAAGACAG 


CGCCTGGTGCTGGTGTTCCTGTTTTTGATACGTATTATTCTTACATGATGCCCATCATAG 


CCATGACTGATGCGTTGGCACCTGAGAGGTATTTTGAATATGATGTGCATAAGGGTTATA 


AATCTTATGATCTCCTCAAGTATGATTATACTGAGGAGAAACAAGATTTGTTTCAGAAGT 


ACTTTAAGTATTGGGATCAAGAGTATCACCCTAACTGTCGCGACTGTAGTGATGACAGGT 





GTTTGATACATTGTGCAAACTTCAACATCTTGTTTTCTACACTTGTACCGCAGACTTCTT 


TCGGTAATTTGTGTAGAAAGGTTTTTGTTGATGGTGTACCATTTATAGCTACTTGTGGCT 


ATCATTCTAAGGAACTTGGTGTTATTATGAATCAAGATAACACCATGTCATTTTCAAAAA 


TGGGTTTGAGTGAACTCATGGAGTTTGTTGGAGATCGTGGCTTGTTAGTGGGGACATGCA 


ATAAATTAGTGGATCTTAGAACGTCTTGTTTTAGTGTTTGTGCTTTAGCGTCTGGTATTA 








CTCATCAAACGGTAAAACCAGGTCACTTTAACAAGGATTTCTACGATTTTGCAGAGAAGG 
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CTGGTATGTTTAAGGAAGGTTCTTCTATACCACTTAAACATTTCTTCTACCCACAGACTG 
GTAATGCTGCTATAAACGATTATGATTATTATCGTTATAACAGGCCTACCATGTTTGATA 
TACGTCAACTTTTATTTTGTTTAGAAGTGACTTCTAAATATTTTGAATGTTATGAAGGCG 
GCTGTATACCAGCAAGCCAAGTTGTAGTTAACAATTTAGATAAGAGTGCAGGTTATCCGT 


TCAATAAGTTTGGAAAGGCCCGTCTCTATTATGAAATGAGTCTAGAGGAGCAGGACCAAC 





TCTTTGAGAGTACAAAGAAGAACGTCCTGCCTACTATAACTCAGATGAATTTAAAATATG 


CCATATCCGCGAAAAATAGAGCGCGTACAGTGGCAGGTGTGTCTATCCTTTCTACTATGA 





CTAATAGGCAGTTTCATCAGAAGATTCTTAAGTCTATAGTCAACACTAGAAACGCTCCTG 


TAGTTATTGGAACAACCAAGTTTTATGGCGGTTGGGATAACATGTTGAGAAACCTTATTC 


AGGGTGTTGAAGACCCGATTCTTATGGGTTGGGATTATCCAAAGTGTGATAGAGCAATGC 


CTAATTTGTTGCGTATAGCAGCATCTTTAGTACTCGCTCGTAAACACACTAATTGTTGTA 


CTTGGTCTGAACGCGTTTATAGGTTGTATAATGAATGCGCTCAGGTTTTATCTGAAACTG 


TCTTAGCTACAGGTGGTATATATGTGAAACCTGGTGGTACTAGCAGTGGAGATGCTACTA 


CTGCTTATGCAAACAGTGTTTTCAACATAATACAAGCCACATCTGCTAATGTTGCGCGTC 


TTTTGAGTGTTATAACGCGTGATATTGTATATGATGACATTAAGAGCTTGCAGTATGAAT 


TGTACCAGCAGGTTTATAGGCGAGTCAATTTTGACCCAGCATTTGTTGAAAAGTTTTATT 


CTTATTTGTGTAAGAATTTCTCATTGATGATCTTGTCTGACGACGGTGTTGTTTGTTATA 


ACAACACATTAGCCAAACAAGGTCTTGTAGCAGATATTTCTGGTTTTAGAGAAGTTCTCT 


ACTATCAGAACAATGTTTTTATGGCTGATTCTAAATGTTGGGTTGAACCAGATTTAGAAA 


AAGGCCCACATGAATTTTGTTCACAGCACACAATGTTAGTGGAGGTTGATGGTGAGCCTA 


GATACTTGCCATATCCAGACCCATCACGTATTTTGTGTGCATGTGTTTTTGTAGATGATT 


TGGATAAGACAGAATCTGTGGCTGTTATGGAGCGTTATATCGCTCTTGCCATAGATGCGT 


ACCCACTAGTACATCATGAAAATGAGGAGTACAAGAAGGTATTCTTTGTGCTTCTTTCAT 


ACATCAGAAAACTCTATCAAGAGCTTTCTCAGAATATGCTTATGGACTACTCTTTTGTAA 


TGGATATAGATAAGGGTAGTAAATTTTGGGAACAGGAGTTCTATGAAAATATGTATAGAG 


CCCCTACAACATTACAGTGTTGTGGCGTTTGTGTAGTGTGTAATAGTCAAACTATATTGC 


GCTGTGGTAATTGTATTCGCAAACCATTTTTGTGTTGTAAGTGTTGCTATGACCATGTCA 


TGCACACAGACCACAAAAATGTTTTGTCTATAAATCCTTACATTTGCTCACAGCCAGGTT 


GTGGTGAAGCAGATGTTACTAAATTGTACCTCGGAGGTATGTCATACTTCTGCGGTAATC 


ATAAACCAAAGTTATCAATACCGTTAGTATCTAATGGTACAGTGTTTGGAATTTACAGGG 


CTAATTGTGCAGGTAGCGAAAATGTTGATGATTTTAATCAACTAGCTACTACTAATTGGT 


CTACTGTGGAACCTTATATTTTGGCAAATCGTTGTGTAGATTCGTTGAGACGCTTTGCTG 


CAGAGACAGTAAAAGCTACAGAAGAATTACATAAGCAACAATTTGCTAGTGCAGAAGTGA 


GAGAAGTACTCTCAGATCGTGAATTGATTCTGTCTTGGGAGCCAGGTAAAACCAGGCCTC 


CATTGAATAGAAATTATGTTTTCACTGGCTTTCACTTTACTAGAACTAGTAAAGTTCAGC 


TCGGTGATTTTACATTTGAAAAAGGTGAAGGTAAGGACGTTGTCTATTATCGAGCGACGT 


CTACTGCTAAATTGTCTGTTGGAGACATTTTTGTTTTAACCTCACACAATGTTGTTTCTC 


TTATAGCGCCAACGTTGTGTCCTCAGCAAACCTTTTCTAGGTTTGTGAATTTAAGACCTA 


ATGTGATGGTACCTGCGTGTTTTGTAAATAACATTCCATTGTACCATTTAGTAGGCAAGC 


AGAAGCGTACTACAGTACAAGGCCCTCCTGGCAGTGGTAAATCCCATTTTGCTATAGGAT 
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TGGCGGCTTACTTTAGTAACGCCCGTGTCGTTTTTACTGCATGCTCTCATGCAGCTGTTG 


ATGCTTTATGTGAAAAAGCTTTTAAGTTTCTTAAAGTAGATGATTGCACTCGTATAGTAC 
CTCAAAGGACTACTATCGATTGCTTCTCTAAGTTTAAAGGTAATGACACAGGCAAAAAGT 
ACATTTTTAGTACTATTAATGCCTTGCCAGAAGTTAGTTGTGACATTCTTTTGGTTGACG 
AGGTTAGTATGTTGACCAATTACGAATTGTCTTTTATTAATGGTAAGATAAACTATCAAT 
ATGTTGTGTATGTAGGTGATCCTGCTCAATTACCGGCGCCTCGTACGTTGCTTAACGGTT 
CACTCTCTCCAAAGGATTATAATGTTGTCACAAACCTTATGGTTTGTGTTAAACCTGACA 
TTTTCCTTGCAAAGTGTTACCGTTGTCCTAAAGAAATTGTAGATACTGTTTCTACTCTTG 
TATATGATGGAAAGTTTATTGCAAATAACCCGGAATCACGTCAGTGTTTCAAGGTTATAG 
TTAATAATGGTAATTCTGATGTAGGACATGAAAGTGGCTCAGCCTACAACATAACTCAAT 
TAGAATTTGTGAAAGATTTTGTCTGTCGCAATAAGGAATGGCGGGAAGCAACATTCATTT 
CACCTTATAATGCTATGAACCAGAGAGCCTACCGTATGCTTGGACTTAATGTTCAGACAG 
TAGACTCGTCTCAAGGTTCGGAGTATGATTATGTTATCTTTTGTGTTACTGCAGATTCGC 
AGCATGCACTGAATATTAACAGATTCAATGTAGCGCTTACAAGAGCCAAGCGTGGTATAC 
TAGTTGTCATGCGTCAGCGTGATGAACTATATTCAGCTCTTAAGTTTATAGAGCTTGATA 
GTGTAGCAAGTCTGCAAGGTACAGGCTTGTTTAAAATTTGCAACAAAGAGTTTAGTGGTG 
TTCACCCAGCTTATGCAGTCACAACTAAGGCTCTTGCTGCAACTTATAAAGTTAATGATG 
AACTTGCTGCACTTGTTAACGTGGAAGCTGGTTCAGAAATAACATATAAACATCTTATTT 
CTTTGTTAGGGTTTAAGATGAGTGTTAATGTTGAAGGCTGCCACAACATGTTTATAACAC 
GTGATGAGGCTATCCGCAACGTAAGAGGTTGGGTAGGTTTTGATGTAGAAGCAACACATG 
CTTGCGGTACTAACATTGGTACTAACCTGCCTTTCCAAGTAGGTTTCTCTACTGGTGCAG 
ACTTTGTAGTTACGCCTGAGGGACTTGTAGATACTTCAATAGGCAATAATTTTGAGCCTG 
TGAATTCTAAAGCACCTCCAGGTGAACAATTTAATCACTTGAGAGCGTTATTCAAAAGTG 
CTAAACCTTGGCATGTTGTAAGGCCAAGGATTGTGCAAATGTTAGCGGATAACCTGTGCA 
ACGTTTCAGATTGTGTAGTGTTTGTCACGTGGTGTCATGGCCTAGAACTAACCACTTTGC 
GCTATTTTGTTAAAATAGGCAAGGACCAAGTTTGTTCTTGCGGTTCTAGAGCAACAACTT 
TTAATTCTCATACTCAGGCTTATGCTTGTTGGAAGCATTGCTTGGGTTTTGATTTTGTTT 
ATAATCCACTCTTAGTGGATATTCAACAGTGGGGTTATTCTGGTAACCTACAATTTAACC 
ATGATTTGCATTGTAATGTGCATGGACACGCACATGTAGCTTCTGCGGATGCTATTATGA 
CGCGTTGTCTTGCAATTAATAATGCATTTTGTCAAGATGTCAACTGGGATTTAACTTACC 
CTCATATAGCAAATGAGGATGAAGTCAATTCTAGCTGTAGATATTTACAACGCATGTATC 
TTAATGCATGTGTTGATGCTCTTAAAGTTAACGTTGTCTATGATATAGGCAACCCTAAAG 
GTATAAAATGTGTTAGACGTGGAGACTTAAATTTTAGATTCTATGATAAGAATCCAATAG 
TACCCAATGTCAAGCAGTTTGAGTATGACTATAATCAGCACAAAGATAAGTTTGCTGATG 
GTCTTTGTATGTTTTGGAATTGTAATGTGGATTGTTATCCCGACAATTCCTTAGTTTGTA 
GGTACGACACACGAAATTTGAGTGTGTTTAACCTACCTGGTTGTAATGGTGGTAGCTTGT 
ATGTTAACAAGCATGCATTCCACACACCTAAATTTGATCGCACTAGCTTTCGTAATTTGA 
AAGCTATGCCATTCTTTTTCTATGACTCATCGCCTTGCGAGACCATTCAATTGGATGGAG 
TTGCGCAAGACCTTGTGTCATTAGCTACGAAAGATTGTATCACAAAATGCAACATAGGCG 


GTGCTGTTTGTAAAAAGCACGCACAAATGTATGCAGATTTTGTGACTTCTTATAATGCAG 











CTGTTACTGCTGGTTTTACTTTTTGGGTTACTAATAATTTTAACCCATATAATTTGTGGA 
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AAAGTTTTTCAGCTCTCCAGTCTATCGACAATATTGCTTATAATATGTATAAGGGTGGTC 
ATTATGATGCTATTGCAGGAGAAATGCCCACTATCGTAACTGGAGATAAAGTTTTTGTTA 
TAGATCAAGGCGTAGAAAAAGCAGTTTTTTTTAATCAAACAATTCTGCCTAGATCTGTAG 
CGTTTGAGCTGTATGCGAAGAGAAATATTCGCACACTGCCAAACAACCGTATTTTGAAAG 
GTTTGGGTGTAGATGTGACTAATGGATTTGTAATTTGGGATTACACGAACCAAACACCAC 


TATACCGTAATACTGTTAAGGTATGTGCATATACAGACATAGAACCAAATGGCCTAATAG 





TGCTGTATGATGATAGATATGGTGATTACCAGTCTTTTCTAGCTGCTGATAATGCTGTTT 


TAGTTTCTACACAGTGTTACAAGCGGTATTCGTATGTAGAAATACCGTCAAACCTGCTTG 


TTCAGAACGGTATTCCGTTAAAAGATGGAGCGAACCTGTATGTTTATAAGCGTGTTAATG 


GTGCGTTTGTTACGCTACCTAACACATTAAACACACAGGGTCGCAGTTATGAAACTTTTG 


AACCTCGTAGTGATGTTGAGCGTGATTTTCTCGACATGTCTGAGGAGAGTTTTGTAGAAA 


AGTATGGTAAAGAATTAGGTCTACAGCACATACTGTATGGTGAAGTTGATAAGCCCCAAT 


TAGGTGGTTTACACACTGTTATAGGTATGTGCAGACTTTTACGTGCGAATAAGTTGAACG 


CAAAGTCTGTTACTAATTCTGATTCTGATGTCATGCAAAATTATTTTGTATTGGCAGACA 


ATGGTTCCTACAAGCAAGTGTGTACTGTTGTGGATTTGCTGCTTGATGATTTCTTAGAAC 


TTCTTAGGAACATACTGAAAGAGTATGGTACTAATAAGTCTAAAGTTGTAACAGTGTCAA 


TTGATTACCATAGCATAAATTTTATGACTTGGTTTGAAGATGGCATTATTAAAACATGTT 


ATCCACAGCTTCAATCAGCATGGACGTGTGGTTATAATATGCCTGAACTTTATAAAGTTC 


AGAATTGTGTTATGGAACCTTGCAACATTCCTAATTATGGTGTTGGAATAGCGTTGCCAA 


GTGGTATTATGATGAATGTGGCAAAGTATACACAACTCTGTCAATACCTTTCGAAAACAA 


CAATGTGTGTACCGCATAATATGCGAGTAATGCATTTTGGAGCTGGAAGTGACAAAGGAG 


TGGCTCCAGGTAGTACTGTTCTTAAACAATGGCTCCCAGAAGGGACACTCCTTGTCGATA 


ATGATATTGTAGACTATGTGTCTGATGCACATGTTTCTGTGCTTTCAGATTGCAATAAAT 


ATAAGACAGAGCACAAGTTTGATCTTGTGATATCTGATATGTATACAGACAATGATTCAA 


AAAGAAAGCATGAAGGCGTGATAGCCAATAATGGCAATGATGACGTTTTCATATATCTCT 


CAAGTTTTCTTCGTAATAATTTGGCTCTAGGTGGTAGTTTTGCTGTAAAAGTGACAGAGA 


CAAGTTGGCACGAAGTTTTATATGACATTGCACAGGATTGTGCATGGTGGACAATGTTTT 


GTACAGCAGTGAATGCCTCTTCTTCAGAAGCATTCTTGGTTGGTGTTAATTATTTGGGTG 


CAAGTGAAAAGGTTAAGGTTAGTGGAAAAACGCTGCACGCAAATTATATATTTTGGAGGA 


ATTGTAATTATTTACAAACCTCTGCTTATAGTATATTTGACGTTGCTAAGTTTGATTTGA 


GATTGAAAGCAACACCAGTTGTTAATTTGAAAACTGAACAAAAGAGAGACTTAGTGTTTA 


ATTTAATTAAGTGTGGTAAGTTACTGGTAAGAGATGTTGGTAACACCTCTTTTACTAGTG 


TACCAAAGTGCCTTTAGACCACCTAATGGTTGGCATTTACACGGGGGTGCTTATGCGGTA 


GTTAATATTTCTAGCGAATCTAATAATGCAGGCTCTTCACCTGGGTGTATTGTTGGTACT 


ATTCATGGTGGTCGTGTTGTTAATGCTTCTTCTATAGCTATGACGGCACCGTCATCAGGT 


ATGGCTTGGTCTAGCAGTCAGTTTTGTACTGCACACTGTAACTTTTCAGATACTACAGTG 


TTTGTTACACATTGTTATAAATATGATGGGTGTCCTATAACTGGCATGCTTCAAAAGAAT 





TTTTTACGTGTTTCTGCTATGAAAAATGGCCAGCTTTTCTATAATTTAACAGTTAGTGTA 


GCTAAGTACCCTACTTTTAAATCATTTCAGTGTGTTAATAATTTAACATCCGTATATTTA 








AATGGTGATCTTGTTTACACCTCTAATGAGACCACAGATGTTACATCTGCAGGTGTTTAT 
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TTTAAAGCTGGTGGACCTATAACTTATAAAGTTATGAGAGAAGTTAAAGCCCTGGCTTAT 


TTTGTTAATGGTACTGCACAAGATGTTATTTTGTGTGATGGATCACCTAGAGGCTTGTTA 
GCATGCCAGTATAATACTGGCAATTTTTCAGATGGCTTTTATCCTTTTATTAATAGTAGT 
TTAGTTAAGCAGAAGTTTATTGTCTATCGTGAAAATAGTGTTAATACTACTTTTACGTTA 
CACAATTTCACTTTTCATAATGAGACTGGCGCCAACCCTAATCCTAGTGGTGTTCAGAAT 


ATTCAAACTTACCAAACACAAACAGCTCAGAGTGGTTATTATAATTTTAATTTTTCCTTT 











CTGAGTAGTTTTGTTTATAAGGAGTCTAATTTTATGTATGGATCTTATCACCCAAGTTGT 


AATTTTAGACTAGAAACTATTAATAATGGCTTGTGGTTTAATTCACTTTCAGTTTCAATT 


GCTTACGGTCCTCTTCAAGGTGGTTGCAAGCAATCTGTCTTTAGTGGTAGAGCAACTTGT 


TGTTATGCTTATTCATATGGAGGTCCTTCGCTGTGTAAAGGTGTTTATTCAGGTGAGTTA 


GATCTTAATTTTGAATGTGGACTGTTAGTTTATGTTACTAAGAGCGGTGGCTCTCGTATA 


CAAACAGCCACTGAACCGCCAGTTATAACTCGACACAATTATAATAATATTACTTTAAAT 


ACTTGTGTTGATTATAATATATATGGCAGAACTGGCCAAGGTTTTATTACTAATGTAACC 


GACTCAGCTGTTAGTTATAATTATCTAGCAGACGCAGGTTTGGCTATTTTAGATACATCT 


GGTTCCATAGACATCTTTGTTGTACAAGGTGAATATGGTCTTACTTATTATTAGGTTAAC 


CCTTGCGAAGATGTCAACCAGCAGTTTGTAGTTTCTGGTGGTAAATTAGTAGGTATTCTT 


ACTTCACGTAATGAGACTGGTTCTCAGCTTCTTGAGAACCAGTTTTACATTAAAATCACT 


AATGGAACACGTCGTTTTAGACGTTCTATTACTGAAAATGTTGGAAATTGCCCTTATGTT 


AGTTATGGTAAGTTTTGTATAAAACCTGATGGTTCAATTGCCACAATAGTACCAAAACAA 


TTGGAACAGTTTGTGGCACCTTTACTTAATGTTACTGAAAATGTGCTCATACCTAACAGT 


TTTAATTTAACTGTTACAGATGAGTACATACAAACGCGTATGGATAAGGTCCAAATTAAT 


TGTCTGCAGTATGTTTGTGGCAATTCTCTGGATTGTAGAGATTTGTTTCAACAATATGGG 


CCTGTTTGTGACAACATATTGTCTGTAGTAAATAGTATTGGTCAAAAAGAAGATATGGAA 


CTTTTGAATTTCTATTCTTCTACTAAACCGGCTGGTTTTAATACACCATTTCTTAGTAAT 


GTTAGCACTGGTGAGTTTAATATTTCTCTTCTGTTAACAACTCCTAGTAGTCCTAGAAGG 


CGTTCTTTTATTGAAGACCTTCTATTTACAAGCGTTGAATCTGTTGGATTACCAACAGAT 


GACGCATACAAAAATTGCACTGCAGGACCTTTAGGTTTTCTTAAGGACCTTGCGTGTGCT 


CGTGAATATAATGGTTTGCTTGTGTTGCCTCCCATTATAACAGCAGAAATGCAAATTTTG 


TATACTAGTTCTCTAGTAGCTTCTATGGCTTTTGGTGGTATTACTGCAGCTGGTGCTATA 


CCTTTTGCCACACAACTGCAGGCTAGAATTAATCACTTGGGTATTACCCAGTCACTTTTG 


TTGAAGAATCAAGAAAAAATTGCTGCTTCCTTTAATAAGGCCATTGGTCGTATGCAGGAA 


GGTTTTAGAAGTACATCTCTAGCATTACAACAAATTCAAGATGTTGTTAATAAGCAGAGT 


GCTATTCTTACTGAGACTATGGCATCACTTAATAAAAATTTTGGTGCTATTTCTTCTATG 


ATTCAAGAAATCTACCAGCAACTTGACGCCATACAAGCAAATGCTCAAGTGGATCGTCTT 


ATAACTGGTAGATTGTCATCACTTTCTGTTTTAGCATCTGCTAAGCAGGCGGAGCATATT 


AGAGTGTCACAACAGCGTGAGTTAGCTACTCAGAAAATTAATGAGTGTGTTAAGTCACAG 


TCTATTAGGTACTCCTTTTGTGGTAATGGACGACATGTTCTAACCATACCGCAAAATGCA 


CCTAATGGTATAGTGTTTATACACTTTTCTTATACTCCAGATAGTTTTGTTAATGTTACT 





GCAATAGTGGGTTTTTGTGTAAAGCCAGCTAATGCTAGTCAGTATGCAATAGTACCCGCT 





AATGGTAGGGGTATTTTTATACAAGTTAATGGTAGTTACTACATCACAGCACGAGATATG 








TATATGCCAAGAGCTATTACTGCAGGAGATATAGTTACGCTTACTTCTTGTCAAGCAAAT 
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TATGTAAGTGTAAATAAGACCGTCATTACTACATTCGTAGACAATGATGATTTTGATTTT 
AATGACGAATTGTCAAAATGGTGGAATGACACTAAGCATGAGCTACCAGACTTTGACAAA 
TTCAATTACACAGTACCTATACTTGACATTGATAGTGAAATTGATCGTATTCAAGGCGTT 
ATACAGGGTCTTAATGACTCTTTAATAGACCTTGAAAAACTTTCAATACTCAAAACTTAT 
ATTAAGTGGCCTTGGTATGTGTGGTTAGCCATAGCTTTTGCCACTATTATCTTCATCTTA 
ATACTAGGATGGGTTTTCTTCATGACTGGATGTTGTGGTTGTTGTTGTGGATGCTTTGGC 
ATTATGCCTCTAATGAGTAAGTGTGGTAAGAAATCTTCTTATTACACGACTTTTGATAAC 
GATGTGGTAACTTAACAATACAGACCTAAAAAGTCTGTTTAATGATTCAAAGTCCCACGT 
CCTTCCTAATAGTATTAATTTTTCTTTGGTGTAAACTTGTACTAAGTTGTTTTAGAGAGT 
TTATTATAGCGCTCCAACAACTAATACAAGTTTTACTCCAAATTATCAATAGTAACTTAC 
AGCCTAGACTGACCCTTTGTCACAGTCTAGACTAATGTTAAACTTAGAAGCAATTATTGA 
AACTGGTGAGCAAGTGATTCAAAAAATCAGTTTCAATTTACAGCATATTTCAAGTGTATT 
AAACACAGAAGTATTTGACCCCTTTGACTATTGTTATTACAGAGGAGGTAATTTTTGGGA 
AATAGAGTCAGCTGAAGATTGTTCAGGTGATGATGAATTTATTGAATAAGTCGCTAGAGG 
AAAATGGAAGTTTTCTAACAGCGCTTTATATATTTGTAGGATTTTTAGCACTTTATCTTC 
TAGGTAGAGCACTTCAAGCATTTGTACAGGCTGCTGATGCTTGTTGTTTATTTTGGTATA 
CATGGGTAGTAATTCCAGGAGCTAAGGGTACAGCCTTTGTATATAAGTATACATATGGTA 
GAAAACTTAACAATCGGGAATTAGAAGCAGTTATTGTCAACGAGTTTCCTAAGAACGGTT 
GGAATAATAAAAATCCAGCAAATTTTCAAGATGTCCAACGAGACAAATTGTACTCTTGAC 
TTTGAACAGTCAGTTGAGCTTTTTAAAGAGTATAATTTATTTATAACTGCATTCTTGTTG 
TTCTTAACCATAATACTTCAGTATGGCTATGCAACAAGAAGTAAGTTTATTTATATACTG 
AAAATGATAGTGTTATGGTGCTTTTGGCCCCTTAACATTGCAGTAGGTGTAATTTCATGT 
ATATACCCACCAAACACAGGAGGTCTTGTCGCAGCGATAATACTTACAGTGTTTGCGTGT 
CTGTCTTTTGTAGGTTATTGGATCCAGAGTATTAGACTCTTTAAGCGGTGTAGGTCATGG 
TGGTCATTTAACCCAGAATCTAATGCCGTAGGTTCAATACTCCTAACTAATGGTCAACAA 
TGTAATTTTGCTATAGAGAGTGTGCCAATGGTGCTTTCTCCAATTATAAAGAATGGTGTT 
CTTTATTGTGAGGGTCAGTGGCTTGCTAAGTGTGAACCAGACCACTTGCCTAAAGATATA 
TTTGTTTGTACACCGGATAGACGTAATATCTACCGTATGGTGCAGAAATATACTGGTGAC 
CAAAGCGGAAATAAGAAACGGTTTGCTACGTTTGTCTATGCAAAGCAGTCAGTAGATACT 
GGCGAGCTAGAAAGTGTAGCAACAGGAGGGAGTAGTCTTTACACCTAAATGTGTGTGTGT 
AGAGAGTATTTAAAATTATTCTTTAATAGTGCCTCTATTTTAAGAGCGCATAATAGTATT 
ATTTTTGAGGATATTAATATAAATCCTCTCTGTTTTATACTCTCTTTTCAAGAGCTATTA 
TTTAAAAAACAGTTTTTCCACTCTTTTGTGCCAAAAACTATTGTTGTTAATGGTGTAACC 
TTTCAAGTAGATAATGGAAAAGTCTACTACGAAGGAAAACCAATTTTTCAGAAAGGTTGT 
TGTAGGTTGTGGTTGAGTTATAAAAAAGATTAAACTACCTACTACACTTATTTTTATAAG 
AGGCGTTTTATCTTACAAGCGCTTAATAAATACGGACGATGAAATGGCTGACTAGTTTTG 
TAAGGGCAGTTATTTCATGTTATAAACCCCTATTATTAACTCAATTAAGAGTATTAGATA 
GGTTAATCTTAGATCATGGACCAAAACACATCTTAACGTGTGTTAGGTGCGTGATTTTGT 


TTCAATTAGATTTAGTTTATAGGTTGGCGTATACGCCTACTCAATCGCTGGTATGAATAA 











TAGTAAAGATAATCCTTTTTGCGGAGCAATAGCAAGAAAAGCGCGAATTTATCTGAGAGA 
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-continued 
AGGATTAGATTGTGTTTACTTTCTTAACAAAGCAGGACAAGCAGAGTCTTGTCCCGCGTG 


TACCTCTCTAGTATTCCAGGGGAAAACTTGTGAGGAACACAAATATAATAATAATCTTTT 
GTCATGGCAAGCGGTAAGGCAACTGGAAAGACAGATGCCCCAGCTCCAGTCATCAAACTA 
GGAGGACCAAAGCCACCTAAAGTTGGTTCTTCTGGAAATGTATCTTGGTTTCAAGCAATA 
AAAGCCAAGAAGTTAAATTCACCTCCGCCTAAGTTTGAAGGTAGCGGTGTTCCTGATAAT 
GAAAATCTAAAACCAAGTCAGCAGCATGGATATTGGAGACGCCAAGCTAGGTTTAAGCCA 
GGTAAAGGTGGAAGAAAACCAGTCCCAGATGCTTGGTATTTTTAGTATACTGGAACAGGA 
CCAGCCGCTAACCTGAATTGGGGTGATAGCCAAGATGGTATAGTGTGGGTTGCTGGTAAG 
GGTGCTGATACTAAATTTAGATCTAATCAGGGTACTCGTGACTCTGACAAGTTTGACCAA 
TATCCGCTACGGTTTTCAGACGGAGGACCTGATGGTAATTTCCGTTGGGATTTCATTCCT 
CTGAATCGTGGCAGGAGTGGGAGATCAACAGCAGCTTCATCAGCAGCATCTAGTAGAGCA 
CCATCACGTGAAGTTTCGCGTGGTCGCAGGAGTGGTTCTGAAGATGATCTTATTGCTCGT 
GCAGCAAGGATAATTCAGGATCAGCAGAAGAAGGGTTCTCGCATTACAAAGGCTAAGGCT 
GATGAAATGGCTCACCGCCGGTATTGCAAGCGCAGTATTCCACCTAATTATAAGGTTGAT 
CAAGTGTTTGGTCCCCGTACTAAAGGTAAGGAGGGAAATTTTGGTGATGACAAGATGAAT 
GAGGAAGGTATTAAGGATGGGCGCGTTACAGCAATGCTCAACCTAGTTCCTAGCAGCCAT 
GCTTGTCTTTTCGGAAGTAGAGTGACGCCCAGACTTCAACCAGATGGGCTGCACTTGAAA 
TTTGAATTTACTACTGTGGTCCCACGTGATGATCCGCAGTTTGATAATTATGTAAAAATT 
TGTGATCAGTGTGTTGATGGTGTAGGAACACGTCCAAAAGATGATGAACCAAGACCAAAG 
TCACGCTCAAGTTCAAGACCTGCAACAAGAGGAAATTCTCCAGCGCCAAGACAGCAGCGC 
CCTAAGAAGGAGAAAAAGCCAAAGAAGCAGGATGATGAAGTGGATAAAGCATTGACCTCA 
GATGAGGAGAGGAACAATGCACAGCTGGAATTTGATGATGAACCCAAGGTAATTAACTGG 
GGGGATTCAGCGCTAGGAGAGAATGAACTTTGAGTAAAATTGAATAGTAAGAGTTAAGGA 
AGATAGGCATGTAGCTTGATTACCTACATGTCTATCGCCAGGGAAATGTCTAATTTGTCT 
ACTTAGTAGCCTGGAAACGAACGGTAGACCCTTAGATTTTAATTTAGTTTAATTTTTAGT 
TTAGTTTAAGTTAGTTTAGAGTAGGTATAAAGATGCCAGTGGCGGGGCCACGCGGAGTAC 
GACCGAGGGTACAGCACTAGGACGCCCATTAGGGGAAGAGCTAAATTTTAGTTTAAGTTA 
AGTTTAATTGGCTATGTATAGTTAAAATTTATAGGCTAGTATAGAGTTAGAGCAAAAAAA 


АААААААААААААААААААА 


Replicase 

In addition to the structural and accessory genes, two- 50 
thirds of a coronavirus genome comprises the replicase gene 
(at the 5' end of the genome), which is expressed as two 
polyproteins, ppla and pplab, in which pplab is an exten- 
sion product of ppla as a result of a -1 ribosomal shift 
mechanism. The two polyproteins are cleaved by two types 55 
of virus-encoded proteinases usually resulting in 16 non- 
structural proteins (Nsp1-16); IBV lacks Nspl thereby 
encoding Nsp2-16. 

Thus Gene 1 in IBV encodes 15 (16 in other coronavi- 
ruses) non-structural proteins (nsp2-16), which are associ- 60 
ated with RNA replication and transcription. 

The term “replicase protein” is used herein to refer to the 
ppla and pplab polyproteins or individual nsp subunits. 

The term 'replicase gene” is used herein to refer to a 
nucleic acid sequence which encodes for replicase proteins. 65 
A summary of the functions of coronavirus nsp proteins is 

provided in Table 1. 


Nsp 


Protein 


1 
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TABLE 1 


Key features 


Conserved within but not between coronavirus genetic 
groups; potential regulatory functions in the host cell. 
Dispensable for MHV and SARS-CoV replication in 
tissue culture 

Acidic domain; macro domain with ADRP and poly 
(ADP-ribose)-binding activities; one or two ZBD- 
containing papain-like proteases; Y domain 
Transmembrane domain 

3C-like main protease, homodimer 

Transmembrane domain 

Interacts with nsp8 to form a hexadecamer complex 
Noncannonical RNA polymerase; interacts with nsp7 to 
form a hexadecameric complex 

ssRNA-binding protein, dimer 

RNA-binding protein, homododecamer, zinc-binding 
domain, known to interact with nsp14 and nsp16 
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Nsp 
Protein Key features 
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Nsp-14 comprises a 3'-to-5' exoribonuclease (ExoN) 
active domain in the amino-terminal region. SARS-CoV 
ExoN has been demonstrated to have metal ion-dependent 
3'-to-5' exoribonuclease activity that acts on both single- 


11 Unknown 5 stranded and double-stranded RNA, but not on DNA. Nsp- 
12 RNA-dependent RNA polymerase у; ; Ivi ; 
13 Zinc-binding domain, NTPase, dNTPase, 5'-t0-3' RNA 14 has been shown to have proof-reading activity. This nsp 
and DNA helicase, RNA 5'-triphosphate has also been shown to have N7-methyltransferase (MT) 
14 3'-to 5' exoribonuclease, zinc-binding domain and N7- activity in the carboxyl-terminal region. 
methyltransferase | Nsp-15 associated NendoU (nidoviral endoribonuclease, 
15 Uridylate-specific endoribonuclease, homohexamer 10 . D" 
16 Putative ribose-2-O-methyltransferase specific for U) RNase activity has been reported for a 
number of coronaviruses, including SARS-CoV, MHV and 
The variant replicase gene encoded by the coronavirus of a 2. Мо pou | he signifi- 
the present invention comprises a mutation in one or more canti y-enhanced by X 1905 = ete Was ПШеасиуцу a 
of the sections of seguence encoding nsp-10, nsp-14, nsp-15 15 the presence of Mg and Ca ; NendoU cleaves at the 3 
or nsp-16. side of uridylate residues in both single-stranded and 
Nsp10 has RNA-binding activity and appears to be double-stranded RNA. The biologically relevant substrate(s) 
involved in homo and/or heterotypic interactions within of coronavirus NendoUs remains to be identified. 
other nsps from the ppla/pplab region. It adopts an ovp fold Nsp-16 has been predicted to mediate ribose-2'-O-meth- 
comprised of five a-helices, one 3,,-helix and three 5, yltransferase (2'-O-MTase) activity and reverse-genetics 
В-ѕігапаѕ. Two zinc-binding sites have been identified that experiments have shown that the 2'-O-MTase domain is 
are formed by conserved cysteine residues and one histidine essential for viral RNA synthesis in HCoV-229E and SARS- 
residue (Cys-74/Cys-77/His-83/Cys-90; Cys-117/Cys-120/ CoV. The enzyme may be involved in the production of the 
Cys-128/Cys-130). The protein has been confirmed to bind cap 1 structures of coronavirus RNAs and it may also 
single-stranded and double-stranded RNA and DNA without 5; cooperate with NendoU and ExoN in other RNA processing 


obvious specificity. Nsp-10 can be cross-linked with nsp-9, 
suggesting the existing of a complex network of protein- 
protein interactions involving nsp-7, -8, -9 and -10. In 
addition, nsp-10 is known to interact with nsp-14 and 
nsp-16. 


pathways. 2'-O-MTase might also methylate specific RNAs 
to protect them from NendoU-mediated cleavage. 

The genomic and protein sequences for nsp-10, -14, -15 
and -16 are provided as SEQ ID NO: 2-5 and 6-9, respec- 
tively. 


(nsp-10 nucleotide sequence- nucleotides 11884-12318 of SEQ ID NO: 


1) 


SEQ ID NO: 


TCTAAAGGTCATGAGACAGAGGAAGTGGATGCTGTAGGCATTCTCTCACTTTGTTCTTTTGCAGTA 
GATCCTGCGGATACATATTGTAAATATGTGGCAGCAGGTAATCAACCTTTAGGTAACTGTGTTAAA 
ATGTTGACAGTACATAATGGTAGTGGTTTTGCAATAACATCAAAGCCAAGTCCAACTCCGGATCAG 
GATTCTTATGGAGGAGCTTCTGTGTGTCTTTATTGTAGAGCACATATAGCACACCTTGGCGGAGCA 
GGAAATTTAGATGGACGCTGTCAATTTAAAGGTTCTTTTGTGCAAATACCTACTACGGAGAAAGAT 
CCTGTTGGATTCTGTCTACGTAACAAGGTTTGCACTGTTTGTCAGTGTTGGATTGGTTATGGATGT 
CAGTGTGATTCACTTAGACAACCTAAACCTTCTGTTCAG 


(nsp-14 nucleotide sequence- nucleotides 16938-18500 of SEQ ID NO: 


1) 


SEQ ID NO: 


GGTACAGGCTTGTTTAAAATTTGCAACAAAGAGTTTAGTGGTGTTCACCCAGCTTATGCAGTCACA 


ACTAAGGCTCTTGCTGCAACTTATAAAGTTAATGATGAACTTGCTGCACTTGTTAACGTGGAAGCT 


GGTTCAGAAATAACATATAAACATCTTATTTCTTTGTTAGGGTTTAAGATGAGTGTTAATGTTGAA 


GGCTGCCACAACATGTTTATAACACGTGATGAGGCTATCCGCAACGTAAGAGGTTGGGTAGGTTTT 


GATGTAGAAGCAACACATGCTTGCGGTACTAACATTGGTACTAACCTGCCTTTCCAAGTAGGTTTC 


TCTACTGGTGCAGACTTTGTAGTTACGCCTGAGGGACTTGTAGATACTTCAATAGGCAATAATTTT 


GAGCCTGTGAATTCTAAAGCACCTCCAGGTGAACAATTTAATCACTTGAGAGCGTTATTCAAAAGT 


GCTAAACCTTGGCATGTTGTAAGGCCAAGGATTGTGCAAATGTTAGCGGATAACCTGTGCAACGTT 


TCAGATTGTGTAGTGTTTGTCACGTGGTGTCATGGCCTAGAACTAACCACTTTGCGCTATTTTGTT 


AAAATAGGCAAGGACCAAGTTTGTTCTTGCGGTTCTAGAGCAACAACTTTTAATTCTCATACTCAG 








GCTTATGCTTGTTGGAAGCATTGCTTGGGTTTTGATTTTGTTTATAATCCACTCTTAGTGGATATT 


CAACAGTGGGGTTATTCTGGTAACCTACAATTTAACCATGATTTGCATTGTAATGTGCATGGACAC 





GCACATGTAGCTTCTGCGGATGCTATTATGACGCGTTGTCTTGCAATTAATAATGCATTTTGTCAA 


2 


3 


US 10,130,701 B2 
33 


-continued 
GATGTCAACTGGGATTTAACTTACCCTCATATAGCAAATGAGGATGAAGTCAATTCTAGCTGTAGA 
TATTTACAACGCATGTATCTTAATGCATGTGTTGATGCTCTTAAAGTTAACGTTGTCTATGATATA 
GGCAACCCTAAAGGTATTAAATGTGTTAGACGTGGAGACTTAAATTTTAGATTCTATGATAAGAAT 
CCAATAGTACCCAATGTCAAGCAGTTTGAGTATGACTATAATCAGCACAAAGATAAGTTTGCTGAT 
GGTCTTTGTATGTTTTGGAATTGTAATGTGGATTGTTATCCCGACAATTCCTTACTTTGTAGGTAC 
GACACACGAAATTTGAGTGTGTTTAACCTACCTGGTTGTAATGGTGGTAGCTTGTATGTTAACAAG 
CATGCATTCCACACACCTAAATTTGATCGCACTAGCTTTCGTAATTTGAAAGCTATGCCATTCTTT 
TTCTATGACTCATCGCCTTGCGAGACCATTCAATTGGATGGAGTTGCGCAAGACCTTGTGTCATTA 
GCTACGAAAGATTGTATCACAAAATGCAACATAGGCGGTGCTGTTTGTAAAAAGCACGCACAAATG 
TATGCAGATTTTGTGACTTCTTATAATGCAGCTGTTACTGCTGGTTTTACTTTTTGGGTTACTAAT 
AATTTTAACCCATATAATTTGTGGAAAAGTTTTTCAGCTCTCCAG 

(nsp-15 nucleotide seguence- nucleotides 18501-19514 of SEO ID NO: 1) 

SEO ID NO: 4 

TCTATCGACAATATTGCTTATAATATGTATAAGGGTGGTCATTATGATGCTATTGCAGGAGAAATG 
CCCACTATCGTAACTGGAGATAAAGTTTTTGTTATAGATCAAGGCGTAGAAAAAGCAGTTTTTTTT 
AATCAAACAATTCTGCCTACATCTGTAGCGTTTGAGCTGTATGCGAAGAGAAATATTCGCACACTG 
CCAAACAACCGTATTTTGAAAGGTTTGGGTGTAGATGTGACTAATGGATTTGTAATTTGGGATTAC 
ACGAACCAAACACCACTATACCGTAATACTGTTAAGGTATGTGCATATACAGACATAGAACCAAAT 
GGCCTAATAGTGCTGTATGATGATAGATATGGTGATTACCAGTCTTTTCTAGCTGCTGATAATGCT 
GTTTTAGTTTCTACACAGTGTTACAAGCGGTATTCGTATGTAGAAATACCGTCAAACCTGCTTGTT 
CAGAACGGTATTCCGTTAAAAGATGGAGCGAACCTGTATGTTTATAAGCGTGTTAATGGTGCGTTT 
GTTACGCTACCTAACACAATAAACACACAGGGTCGAAGTTATGAAACTTTTGAACCTCGTAGTGAT 
GTTGAGCGTGATTTTCTCGACATGTCTGAGGAGAGTTTTGTAGAAAAGTATGGTAAAGAATTAGGT 
CTACAGCACATACTGTATGGTGAAGTTGATAAGCCCCAATTAGGTGGTTTCCACACTGTTATAGGT 
ATGTGCAGACTTTTACGTGCGAATAAGTTGAACGCAAAGTCTGTTACTAATTCTGATTCTGATGTC 
ATGCAAAATTATTTTGTATTGGCAGACAATGGTTCCTACAAGCAAGTGTGTACTGTTGTGGATTTG 
CTGCTTGATGATTTCTTAGAACTTCTTAGGAACATACTGAAAGAGTATGGTACTAATAAGTCTAAA 
GTTGTAACAGTGTCAATTGATTACCATAGCATAAATTTTATGACTTGGTTTGAAGATGGCATTATT 
AAAACATGTTATCCACAGCTTCAA 

(nsp-16 nucleotide sequence- nucleotides 19515-20423 of SEO ID NO: 1) 

SEO ID NO: 5 

TCAGCATGGACGTGTGGTTATAATATGCCTGAACTTTATAAAGTTCAGAATTGTGTTATGGAACCT 
TGCAACATTCCTAATTATGGTGTTGGAATAGCGTTGCCAAGTGGTATTATGATGAATGTGGCAAAG 
TATACACAACTCTGTCAATACCTTTCGAAAACAACAATGTGTGTACCGCATAATATGCGAGTAATG 
CATTTTGGAGCTGGAAGTGACAAAGGAGTGGTGCCAGGTAGTACTGTTCTTAAACAATGGCTCCCA 
GAAGGGACACTCCTTGTCGATAATGATATTGTAGACTATGTGTCTGATGCACATGTTTCTGTGCTT 
TCAGATTGCAATAAATATAAGACAGAGCACAAGTTTGATCTTGTGATATCTGATATGTATACAGAC 
AATGATTCAAAAAGAAAGCATGAAGGCGTGATAGCCAATAATGGCAATGATGACGTTTTCATATAT 
CTCTCAAGTTTTCTTCGTAATAATTTGGCTCTAGGTGGTAGTTTTGCTGTAAAAGTGACAGAGACA 


AGTTGGCACGAAGTTTTATATGACATTGCACAGGATTGTGCATGGTGGACAATGTTTTGTACAGCA 





GTGAATGCCTCTTCTTCAGAAGCATTCTTGATTGGTGTTAATTATTTGGGTGCAAGTGAAAAGGTT 


AAGGTTAGTGGAAAAACGCTGCACGCAAATTATATATTTTGGAGGAATTGTAATTATTTACAAACC 
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-continued 
TCTGCTTATAGTATATTTGACGTTGCTAAGTTTGATTTGAGATTGAAAGCAACGCCAGTTGTTAAT 


TTGAAAACTGAACAAAAGACAGACTTAGTCTTTAATTTAATTAAGTGTGGTAAGTTACTGGTAAGA 
GATGTTGGTAACACCTCTTTTACTAGTGACTCTTTTGTGTGTACTATGTAG 


(nsp-10 amino acid sequence) 


SEQ ID NO: 


SKGHETEEVDAVGILSLCSFAVDPADTYCKYVAAGNQPLGNCVKMLTVKNGSGFAITSKPSPTPDQ 
DSYGGASVCLYCRAHIAHPGGAGNLDGRCQFKGSFVQIPTTEKDPVGFCLRNKVCTVCQCWIGYGC 
QCDSLRQPKPSVQ 


(nsp-14 amino acid sequence) 


SEQ ID NO: 


GTGLFKICNKEFSGVHPAYAVTTKALAATYKVNDELAALVNVEAGSEITYKHLISLLGFKMSVNVE 
GCHNMFITRDEAIRNVRGWVGFDVEATHACGTNIGTNLPFQVGFSTGADFVVTPEGLVDTSIGNNF 
EPVNSKAPPGEOFNHLRALFKSAKPWHVVRPRIVOMLADNLCNVSDCVVFVTWCHGLELTTLRYFV 
KIGKDQVCSCGSRATTENSHTQAYACWKHCLGFDFVYNPLLVDIQOWGYSGNLOFNHDLHCNVHGH 
AHVASADAIMTRCLAINNAFCQDVNWDLTYPHIANEDEVNSSCRYLORMYLNACVDALKVNVVYDI 
GNPKGIKCVRRGDLNFRFYDKNPIVPNVKQFEYDYNOHKDKFADGLCMFWNCNVDCYPDNSLVCRY 
DTRNLSVFNLPGCNGGSLYVNKHAFHTPKFDRTSFRNLKAMPFFFYDSSPCETIOLDGVAODLVSL 
ATKDCITKCNICGAVCKKKAOMYADFVTSYNAAVTAGFTFWVTNNFNPYNLWKSFSALO 


(nsp-15 amino acid sequence) 


SEQ ID NO: 8 


SIDNIAYNMYKGGHYDAIAGEMPTIVTGDKVFVIDOGVEKAVFFNOTILPTSVAFELYAKRNIRTL 
PNNRILKGLGVDVTNGFVIWDYTNOTPLYRNTVKVCAYTDIEPNGLIVLYDDRYGDYOSFLAADNA 
VLVSTOCYKRYSYVEIPSNLLVONGIPLKDGANLYVYKRVNGAFVTLPNTLNTOGRSYETFEPRSD 
VERDFLDMSEESFVEKYGKELGLOHILYGEVDKPOLGGLHTVIGMCRLLRANKLNAKSVTNSDSDV 
MQNYFVLADNGSYKQVCTVVDLLLDDFLELLRNILKEYGTNKSKVVTVSIDYHSINFMTWFEDGII 
KTCYPQLQ 


(nsp-16 amino acid sequence) 


SEQ ID NO: 9 


SAWTCGYNMPELYKVONCVMEPCNIPNYGVGIALPSGIMMNVAKYTOLCOYLSKTTMCVPHNMRVM 


HFGAGSDKGVAPGSTVLKOWLPEGTLLVDNDIVDYVSDAHVSVLSDCNKYKTEHKFDLVISDMYTD 


NDSKRKHEGVIANNGNDDVFIYLSSFLRNNLALGGSFAVKVTETSWHEVLYDIAODCAWWTMFCTA 


VNASSSEAFLVGVNYLGASEKVIWSGKTLHANYIFWRNCNYLOTSAYSIFDVAKFDLRLKATPVVN 
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LKTEOKTDLVFNLIKCGKLLVRDVGNTSFTSDSFVCTM 


Reduced Pathogenicity 


The live, attenuated coronavirus of the present invention 
comprises a variant replicase gene which causes the virus to 
have reduced pathogenicity compared to a coronavirus 
expressing the corresponding wild-type gene. 


The term “attenuated” as used herein, refers to a virus that 
exhibits said reduced pathogenicity and may be classified as 
non-virulent. A live, attenuated virus is a weakened repli- 
cating virus still capable of stimulating an immune response 
and producing immunity but not causing the actual illness. 


The term “pathogenicity” is used herein according to its 
normal meaning to refer to the potential of the virus to cause 
disease in a subject. Typically the pathogenicity of a coro- 
navirus is determined by assaying disease associated symp- 
toms, for example sneezing, snicking and reduction in 
tracheal ciliary activity. 


50 


55 


60 
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The term “reduced pathogenicity” is used to describe that 
the level of pathogenicity of a coronavirus is decreased, 
lessened or diminished compared to a corresponding, wild- 
type coronavirus. 

In one embodiment, the coronavirus of the present inven- 
tion has a reduced pathogenicity compared to the parental 
M41-CK virus from which it was derived or a control 
coronavirus. The control coronavirus may be a coronavirus 
with a known pathogenicity, for example a coronavirus 
expressing the wild-type replicase protein. 

The pathogenicity of a coronavirus may be assessed 
utilising methods well-known in the art. Typically, patho- 
genicity is assessed by assaying clinical symptoms in a 
subject challenged with the virus, for example a chicken. 

As an illustration, the chicken may be challenged at 8-24 
days old by nasal or ocular inoculation. Clinical symptoms, 
associated with IBV infection, may be assessed 3-10 days 
post-infection. Clinical symptoms commonly assessed to 
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determine the pathogenicity of a coronavirus, for example an 
IBV, include gasping, coughing, sneezing, snicking, depres- 
sion, ruffled feathers and loss of tracheal ciliary activity. 

The variant replicase of the present invention, when 
expressed in a coronavirus, may cause a reduced level of 5 
clinical symptoms compared to a coronavirus expressing a 
wild-type replicase. 

For example a coronavirus expressing the variant repli- 
case may cause a number of snicks per bird per minute 
which is less than 90%, less than 80%, less than 70%, less 
than 60%, less than 50%, less than 40%, less than 30%, less 
than 20% or less than 10% of the number of snicks caused 
by a virus expressing the wild type replicase. 

A coronavirus expressing a variant replicase according to 
the present invention may cause wheezing in less than 70%, 
less than 60%, less than 50%, less than 40%, less than 30%, 
less than 20% or less than 10% of the number of birds in a 
flock infected with the a virus expressing the wild type 
replicase. 

A coronavirus expressing a variant replicase according to 
the present invention may result in tracheal ciliary activity 
which is at least 60%, at least 70%, at least 80%, at least 90% 
or at least 95% of the level of tracheal ciliary activity in 
uninfected birds. 

A coronavirus expressing a variant replicase according to 
the present invention may cause clinical symptoms, as 
defined in Table 2, at a lower level than a coronavirus 
expressing the wild type replicase. 


TABLE 2 
IBV severity limits based on clinical signs: 


Snicking (sneezing) 

Nasal exudate 

Watery eyes 

Swollen infraorbital sinuses up to 7d). 
Rales (vibration in trachea or bronchi region) 


Hunched posture/depressed 
Fluffed up feathers 


derat 
Eating and drinking less moderate 


Drinking in excess: evident by fluid filled crop or 





measured water intake 





Less active but still evade capture 
Weight loss 

Not eating or drinking moderate: 
Birds sit alone and does not evade capture 

Severe respiratory distress: e.g. excessive gasping method. 
Snicking and/or rales for 7d in total 


Mild, if exceed 2d increase to 


Mild, if exceed 1d increase to 
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levels when administered to broiler chickens with maternal 
antibodies as these strains are neutralized by the maternal 
antibody pool. 

Thus a viral particle must be sufficiently efficient at 
replicating and propagating to ensure that it is not neutral- 
ized by the maternally-derived antibodies against the virus. 
Maternally-derived antibodies are a finite pool of effective 
antibodies, which decrease as the chicken ages, and neu- 
tralization of the virus in this manner does not equate to the 
establishment of long-term immunity for the embryo/chick. 
In order to develop long-term immunity against the virus, 
the embryo and hatched chicken must develop an appropri- 
ate protective immune response which is distinct to the effect 
of the maternally-derived antibodies. 

To be useful for in ovo vaccination, the virus must also not 
replicate and propagate at a level which causes it to be 
pathogenic to the embryo. 

Reduced pathogenicity in terms of the embryo may mean 
that the coronavirus causes less reduction in hatchability 
compared to a corresponding, wild-type control coronavirus. 
Thus the term “without being pathogenic to the embryo” in 
the context of the present invention may mean “without 
causing reduced hatchability” when compared to a control 
coronavirus. 

A suitable variant replicase may be identified using meth- 
ods which are known in the art. For example comparative 
challenge experiments following in ovo vaccination of 


IBV specific: Mild (N.B. Respiratory signs 
become apparent from 2-3 dpi if they 
are going to occur and can continue for 


IBV specific: Mild, if exceed 24h increase to 
moderate for a max of 2d. If still drinking in excess 
then kill by schedule 1 method. 


Moderate: birds at end point. Kill by schedule 1 


Severe: report to project license 


Found dead holder. 


Full post-mortem to be performed. 


The variant replicase of the present invention, when 
expressed in a coronavirus, may cause the virus to replicate 


at non-pathogenic levels in ovo. 60 


While developing vaccines to be administered in ovo to 
chicken embryos, attention must be paid to two points: the 
effect of maternal antibodies on the vaccines and the effect 
of the vaccines on the embryo. Maternal antibodies are 
known to interfere with active immunization. For example, 
vaccines with mild strains do not induce protective antibody 
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embryos with or without maternally-derived antibodies may 
be performed (i.e. wherein the layer has or has not been 
vaccinated against IBV). 

If the variant replicase enables the virus to propagate at a 
level which is too high, the embryo will not hatch or will not 
be viable following hatching (i.e. the virus is pathogenic to 
the embryo). A virus which is pathogenic to the embryo may 
kill the embryo. 

If the variant replicase causes a reduction in viral repli- 
cation and propagation which is too great, the virus will be 
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neutralised by the maternally-derived antibodies. Subse- 
quent challenge of the chick with IBV will therefore result 
in the development of clinical symptoms (for example 
wheezing, snicking, loss of ciliary activity) and the onset of 
disease in the challenged chick; as it will have failed to 
develop effective immunity against the virus. 

Variant 

As used herein, the term “variant” is synonymous with 
*mutant' and refers to a nucleic acid or amino acid sequence 
which differs in comparison to the corresponding wild-type 
sequence. 

A variant/mutant sequence may arise naturally, or may be 
created artificially (for example by site-directed mutagen- 
esis). The mutant may have at least 70, 80, 90, 95, 98 or 99% 
sequence identity with the corresponding portion of the wild 
type sequence. The mutant may have less than 20, 10, 5, 4, 
3, 2 or 1 mutation(s) over the corresponding portion of the 
wild-type sequence. 

The term *wild type" is used to mean a gene or protein 
having a nucleotide or amino acid sequence which is iden- 
tical with the native gene or protein respectively (i.e. the 
viral gene or protein). 

Identity comparisons can be conducted by eye, or more 
usually, with the aid of readily available sequence compari- 
son programs. These commercially available computer pro- 
grams can calculate % identity between two or more 
sequences. A suitable computer program for carrying out 
such an alignment is the GCG Wisconsin Bestfit package 
(University of Wisconsin, U.S.A.; Devereux et al., 1984, 
Nucleic Acids Research 12:387). Examples of other soft- 
ware that can perform sequence comparisons include, but 
are not limited to, the BLAST package (see Ausubel et al., 
1999 ibid— Chapter 18), FASTA (Atschul et al., 1990, J. 
Mol. Biol., 403-410) and the GENEWORKS suite of com- 
parison tools, ClustalX (see Larkin et al. (2007) Clustal W 
and Clustal X version 2.0. Bioinformatics, 23:2947-2948). 
Both BLAST and FASTA are available for offline and online 
searching (see Ausubel et al., 1999 ibid, pages 7-58 to 7-60). 
However, for some applications, it is preferred to use the 
GCG Bestf it program. A new tool, called BLAST 2 
Sequences is also available for comparing protein and 
nucleotide sequence (see FEMS Microbiol Lett 1999 174(2): 
247-50; FEMS Microbiol Lett 1999 177(1): 187-8 and 
tatiana(Oncbi.nlm.nih.gov). 

The sequence may have one or more deletions, insertions 
or substitutions of amino acid residues which produce a 
silent change and result in a functionally equivalent mol- 
ecule. Deliberate amino acid substitutions may be made on 
the basis of similarity in polarity, charge, solubility, hydro- 
phobicity, hydrophilicity, and/or the amphipathic nature of 
the residues as long as the activity is retained. For example, 
negatively charged amino acids include aspartic acid and 
glutamic acid; positively charged amino acids include lysine 
and arginine; and amino acids with uncharged polar head 
groups having similar hydrophilicity values include leucine, 
isoleucine, valine, glycine, alanine, asparagine, glutamine, 
serine, threonine, phenylalanine, and tyrosine. 

Conservative substitutions may be made, for example 
according to the Table below. Amino acids in the same block 
in the second column and preferably in the same line in the 
third column may be substituted for each other: 


ALIPHATIC Non-polar 


GA 
IL 
Polar- uncharged cs 
NO 
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AROMATIC НЕМУ 


The coronavirus of the present invention may comprise а 
variant replicase gene which encodes a protein which com- 
prises a mutation compared to any one of SEQ ID NO: 6, 7, 
8 or 9 which, when expressed in a coronavirus, causes the 
virus to have reduced pathogenicity compared to a corona- 
virus expressing the corresponding wild-type replicase. 

The variant replicase gene may encode a protein which 
comprises at least one or more amino acid mutations in any 
combination of nsp-10, nsp-14, nsp-15 and nsp-16. 

The variant replicase gene of the coronavirus of the 
present invention may encode a protein comprising a muta- 
tion as defined in the M41 mod sequences presented in FIG. 
10. 

The variant replicase gene of the coronavirus of the 
present invention may encode a protein which comprises 
one or more amino acid mutations selected from the list of: 

Pro to Leu at position 85 of SEQ ID NO: 6, 

Val to Leu at position 393 of SEQ ID NO: 7; 

Leu to Ile at position 183 of SEO ID NO: 8; 

Val to Ile at position 209 of SEO ID NO: 9. 

The variant replicase gene of the coronavirus of the 
present invention may encode a protein which does not 
comprise a mutation in nsp-2, nsp-3, nsp-6 or nsp-13. 

The variant replicase gene of the coronavirus of the 
present invention may encode a protein which does not 
comprise a mutation in nsplO which corresponds to the 
threonine to isoleucine mutation caused by a mutation at 
nucleotide position 12,008 in the gene reported by Ammay- 
appan et al. (Arch Virol (2009) 154:495-499). 

Ammayappan et al (as above) reports the identification of 
sequence changes responsible for the attenuation of IBV 
strain Arkansas DPI. The study identified 17 amino acid 
changes in a variety of IBV proteins following multiple 
passages, approx. 100, of the virus in embryonated eggs. It 
was not investigated whether the attenuated virus (Ark DPI 
101) is capable of replicating in the presence of maternally- 
derived antibodies against the virus in ovo, without being 
pathogenic to the embryo. Given that this virus was pro- 
duced by multiple passage in SPF embryonated eggs, similar 
methodology for classical IBV vaccines, it is likely that this 
virus is pathogenic for embryos. The virus may also be 
sensitive to maternally-derived antibodies if the hens were 
vaccinated with a similar serotype. 

The variant replicase gene of the coronavirus of the 
present invention may encode a protein which comprises 
any combination of one or more amino acid mutations 
provided in the list above. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutation Pro to Leu at position 85 
of SEQ ID NO: 6. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutation Val to Leu at position 
393 of SEQ ID NO: 7. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutation Leu to Ile at position 183 
of SEQ ID NO: 8. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutation Val to Ile at position 209 
of SEQ ID NO: 9. 
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The variant replicase gene may encode a protein which 
comprises the amino acid mutations Pro to Leu at position 
85 of SEQ ID NO: 6, and Val to Leu at position 393 of SEQ 
ID NO: 7. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutations Pro to Leu at position 
85 of SEQ ID NO: 6 Leu to Ile at position 183 of SEQ ID 
NO: 8. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutations Pro to Leu at position 
85 of SEQ ID NO: 6 and Val to lle at position 209 of SEQ 
ID NO: 9. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutations Val to Leu at position 
393 of SEQ ID NO: 7 and Leu to lle at position 183 of SEQ 
ID NO: 8. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutations Val to Leu at position 
393 of SEO ID NO: 7 and Val to Ile at position 209 of SEO 
ID NO: 9. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutations Leu to Ile at position 
183 of SEQ ID NO: 8 and Val to Ile at position 209 of SEQ 
ID NO: 9. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutations Pro to Leu at position 
85 of SEQ ID NO: 6, Val to Leu at position 393 of SEQ ID 
NO: 7 and Leu to Ile at position 183 of SEO ID NO: 8. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutations Pro to Leu at position 
85 of SEO ID NO: 6 Leu to Ile at position 183 of SEO ID 
NO: 8 and Val to Ile at position 209 of SEO ID NO: 9. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutations Pro to Leu at position 
85 of SEO ID NO: 6, Val to Leu at position 393 of SEO ID 
NO: 7 and Val to Ile at position 209 of SEO ID NO: 9. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutations Val to Leu at position 
393 of SEO ID NO: 7, Leu to Ile at position 183 of SEO ID 
NO: 8 and Val to Ile at position 209 of SEO ID NO: 9. 

The variant replicase gene may encode a protein which 
comprises the amino acid mutations Pro to Leu at position 
85 of SEO ID NO: 6, Val to Leu at position 393 of SEO ID 
NO: 7, Leu to Ile at position 183 of SEO ID NO: 8 and Val 
to Ile at position 209 of SEQ ID NO: 9. 

The variant replicase gene may also be defined at the 
nucleotide level. 

For example the nucleotide seguence of the variant rep- 
licase gene of the coronavirus of the present invention may 
comprise one or more nucleotide substitutions within the 
regions selected from the list of: 11884-12318, 16938- 
18500, 18501-19514 and 19515-20423 of SEO ID NO:1. 

For example the nucleotide seguence of the variant rep- 
licase gene of the coronavirus of the present invention may 
comprise one or more nucleotide substitutions selected from 
the list of: 

C to Tat nucleotide position 12137; 

G to C at nucleotide position 18114; 

T to A at nucleotide position 19047; and 

G to A at nucleotide position 20139; 
compared to the seguence shown as SEO ID NO: 1. 

As used herein, the term *substitution” is synonymous 
with the term mutation and means that the nucleotide at the 
identified position differs to that of the wild-type nucleotide 
seguence. 

The nucleotide seguence may comprise any combination 
of the nucleotide substitutions selected from the list of: 
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C to Tat nucleotide position 12137; 

G to Cat nucleotide position 18114; 

T to A at nucleotide position 19047; and 

G to A at nucleotide position 20139; 
compared to the sequence shown as SEQ ID NO: 1. 

The nucleotide sequence may comprise the substitution 
C12137T. 

The nucleotide sequence may comprise substitution 
С18114С. 

The nucleotide sequence may comprise the substitution 
T19047A. 

The nucleotide sequence may comprise the substitution 
G20139A. 

The nucleotide seguence may comprise the substitutions 
C12137T and G18114C. 

The nucleotide sequence may comprise the substitutions 
C12137T and T19047A. 

The nucleotide sequence may comprise the substitutions 
C12137T and G20139A. 

The nucleotide seguence may comprise the substitutions 
G18114C and T19047A. 

The nucleotide sequence may comprise the substitutions 
G18114C and G20139A. 

The nucleotide sequence may comprise the substitutions 
T19047A and G20139A. 

The nucleotide sequence may comprise the substitutions 
C12137T, G18114C and T19047A. 

The nucleotide sequence may comprise the substitutions 
C12137T, T19047A and G20139A. 

The nucleotide sequence may comprise the substitutions 
C12137T, G18114C and G20139A. 

The nucleotide sequence may comprise the substitutions 
G18114C, T19047A and G20139A. 

The nucleotide sequence may comprise the substitutions 
C12137T, G18114C, T19047A and G20139A. 

The nucleotide sequence may not comprise a substitution 
which corresponds to the C12008T substitution reported by 
Ammayappan et al. (as above). 

The nucleotide sequence may be natural, synthetic or 
recombinant. It may be double or single stranded, it may be 
DNA or RNA or combinations thereof. It may, for example, 
be cDNA, PCR product, genomic seguence or mRNA. 

The nucleotide seguence may be codon optimised for 
production in the host/host cell of choice. 

It may be isolated, or as part of a plasmid, virus or host 
cell. 

Plasmid 

A plasmid is an extra-chromosomal DNA molecule sepa- 
rate from the chromosomal DNA which is capable of 
replicating independently of the chromosomal DNA. They 
are usually circular and double-stranded. 

Plasmids, or vectors (as they are sometimes known), may 
be used to express a protein in a host cell. For example a 
bacterial host cell may be transfected with a plasmid capable 
of encoding a particular protein, in order to express that 
protein. The term also includes yeast artificial chromosomes 
and bacterial artificial chromosomes which are capable of 
accommodating longer portions of DNA. 

The plasmid of the present invention comprises a nucleo- 
tide seguence capable of encoding a defined region of the 
replicase protein. It may also comprise one or more addi- 
tional coronavirus nucleotide seguence(s), or nucleotide 
seguence(s) capable of encoding one or more other corona- 
virus proteins such as the S gene and/or gene 3. 

The plasmid may also comprise a resistance marker, such 
as the guanine xanthine phosphoribosyltransferase gene 
(gpt) from Escherichia coli, which confers resistance to 
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mycophenolic acid (MPA) in the presence of xanthine and 
hypoxanthine and is controlled by the vaccinia virus P7.5 
early/late promoter. 

Recombinant Vaccinia Virus 

The present invention also relates to a recombinant vac- 
cinia virus (rVV) comprising a variant replicase gene as 
defined herein. 

The recombinant vaccinia virus (rVV) may be made using 
a vaccinia-virus based reverse genetics system. 

In this respect, the present invention also provides a 
method for making a viral particle by: 

(1) transfecting a plasmid as described in the previous 

section into a host cell; 

(11) infecting the host cell with a recombining virus 
comprising the genome of a coronavirus strain with a 
replicase gene; 

(iii) allowing homologous recombination to occur 
between the replicase gene sequences in the plasmid 
and the corresponding sequences in the recombining 
virus genome to produce a modified replicase gene; 

(iv) selecting for recombining virus comprising the modi- 
fied replicase gene. 

The term *modified replicase gene' refers to a replicase 
gene which comprises a variant replicase gene as described 
in connection with the first aspect of the present invention. 
Specifically, the term refers to a gene which is derived from 
a wild-type replicase gene but comprises a nucleotide 
sequence which causes it to encode a variant replicase 
protein as defined herein. 

The recombination may involve all or part ofthe replicase 
gene. For example the recombination may involve a nucleo- 
tide sequence encoding for any combination of nsp-10, 
nsp-14, nsp-15 and/or nsp-16. The recombination may 
involve a nucleotide sequence which encodes for an amino 
acid mutation or comprises a nucleotide substitution as 
defined above. 

The genome of the coronavirus strain may lack the part of 
the replicase protein corresponding to the part provided by 
the plasmid, so that a modified protein is formed through 
insertion of the nucleotide sequence provided by the plas- 
mid. 

The recombining virus is one suitable to allow homolo- 
gous recombination between its genome and the plasmid. 
The vaccinia virus is particularly suitable as homologous 
recombination is routinely used to insert and delete 
sequences for the vaccinia virus genome. 

The above method optionally includes the step: 

(v) recovery of recombinant coronavirus comprising the 
modified replicase gene from the DNA from the recom- 
bining virus from step (iv). 

Methods for recovering recombinant coronavirus, such as 
recombinant IBV, are known in the art (See Britton et al 
(2005) see page 24; and PCT/GB2010/001293). 

For example, the DNA from the recombining virus from 
step (1v) may be inserted into a plasmid and used to transfect 
cells which express cytoplasmic T7 RNA polymerase. The 
cells may, for example be pre-infected with a fowlpox virus 
expressing T7 RNA polymerase. Recombinant coronavirus 
may then be isolated, for example, from the growth medium. 

When the plasmid is inserted into the vaccinia virus 
genome, an unstable intermediate is formed. Recombinants 
comprising the plasmid may be selected for e.g. using a 
resistance marker on the plasmid. 

Positive recombinants may then be verified to contain the 
modified replicase gene by, for example, PCR and sequenc- 
ing. 
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Large stocks of the recombining virus including the 
modified replicase gene (e.g. recombinant vaccinia virus, 
(rVV) may be grown up and the DNA extracted in order to 
carry out step (v)). 

Suitable reverse genetics systems are known in the art 
(Casais et al (2001) J. Virol 75:12359-12369; Casais et al 
(2003) J. Virol. 77:9084-9089; Britton et al (2005) J. Viro- 
logical Methods 123:203-211; Armesto et al (2008) Methods 
in Molecular Biology 454:255-273). 

Cell 

The coronavirus may be used to infect a cell. 

Coronavirus particles may be harvested, for example from 
the supernatant, by methods known in the art, and optionally 
purified. 

The cell may be used to produce the coronavirus particle. 

Thus the present invention also provides a method for 
producing a coronavirus which comprises the following 
steps: 

(1) infection of a cell with a coronavirus according to the 
invention; 

(i1) allowing the virus to replicate in the cell; and 

(iii) harvesting the progeny virus. 

The present invention also provides a cell capable of 
producing a coronavirus according to the invention using a 
reverse genetics system. For example, the cell may comprise 
a recombining virus genome comprising a nucleotide 
sequence capable of encoding the replicase gene of the 
present invention. 

The cell may be able to produce recombinant recombining 
virus (e.g. vaccinia virus) containing the replicase gene. 

Alternatively the cell may be capable of producing recom- 
binant coronavirus by a reverse genetics system. The cell 
may express or be induced to express T7 polymerase in 
order to rescue the recombinant viral particle. 

Vaccine 

The coronavirus may be used to produce a vaccine. The 
vaccine may by a live attenuated form of the coronavirus of 
the present invention and may further comprise a pharma- 
ceutically acceptable carrier. As defined herein, “pharma- 
ceutically acceptable carriers" suitable for use in the inven- 
tion are well known to those of skill in the art. Such carriers 
include, without limitation, water, saline, buffered saline, 
phosphate buffer, alcohol/aqueous solutions, emulsions or 
suspensions. Other conventionally employed diluents and 
excipients may be added in accordance with conventional 
techniques. Such carriers can include ethanol, polyols, and 
suitable mixtures thereof, vegetable oils, and injectable 
organic esters. Buffers and pH adjusting agents may also be 
employed. Buffers include, without limitation, salts prepared 
from an organic acid or base. Representative buffers include, 
without limitation, organic acid salts, such as salts of citric 
acid, e.g., citrates, ascorbic acid, gluconic acid, histidine- 
Hel, carbonic acid, tartaric acid, succinic acid, acetic acid, or 
phthalic acid, Iris, trimethanmine hydrochloride, or phos- 
phate buffers. Parenteral carriers can include sodium chlo- 
ride solution, Ringer’s dextrose, dextrose, trehalose, 
sucrose, and sodium chloride, lactated Ringer’s or fixed oils. 
Intravenous carriers can include fluid and nutrient replen- 
ishers, electrolyte replenishers, such as those based on 
Ringer’s dextrose and the like. Preservatives and other 
additives such as, for example, antimicrobials, antioxidants, 
chelating agents (e.g., EDTA), inert gases and the like may 
also be provided in the pharmaceutical carriers. The present 
invention is not limited by the selection of the carrier. The 
preparation of these pharmaceutically acceptable composi- 
tions, from the above-described components, having appro- 
priate pH isotonicity, stability and other conventional char- 
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acteristics is within the skill of the art. See, e.g., texts such 
as Remington: The Science and Practice of Pharmacy, 20th 
ed, Lippincott Williams & Wilkins, pub!., 2000; and The 
Handbook of Pharmaceutical Excipients, 4.sup.th edit., eds. 
R. C. Rowe et al, APhA Publications, 2003. 

The vaccine of the invention will be administered in a 
“therapeutically effective amount”, which refers to an 
amount of an active ingredient, e.g., an agent according to 
the invention, sufficient to effect beneficial or desired results 
when administered to a subject or patient. An effective 
amount can be administered in one or more administrations, 
applications or dosages. A therapeutically effective amount 
of a composition according to the invention may be readily 
determined by one of ordinary skill in the art. In the context 
of this invention, a *therapeutically effective amount” is one 
that produces an objectively measured change in one or 
more parameters associated Infectious Bronchitis condition 
sufficient to effect beneficial or desired results. An effective 
amount can be administered in one or more administrations. 
For purposes of this invention, an effective amount of drug, 
compound, or pharmaceutical composition is an amount 
sufficient to reduce the incidence of Infectious Bronchitis. 
As used herein, the term “therapeutic” encompasses the full 
spectrum of treatments for a disease, condition or disorder. 
A “therapeutic” agent of the invention may act in a manner 
that is prophylactic or preventive, including those that 
incorporate procedures designed to target animals that can 
be identified as being at risk (pharmacogenetics); or in a 
manner that is ameliorative or curative in nature; or may act 
to slow the rate or extent of the progression of at least one 
symptom of a disease or disorder being treated. 

The present invention also relates to a method for pro- 
ducing such a vaccine which comprises the step of infecting 
cells, for example Vero cells, with a viral particle comprising 
a replicase protein as defined in connection with the first 
aspect of the invention. 

Vaccination Method 

The coronavirus of the present invention may be used to 
treat and/or prevent a disease. 

To “treat” means to administer the vaccine to a subject 
having an existing disease in order to lessen, reduce or 
improve at least one symptom associated with the disease 
and/or to slow down, reduce or block the progression of the 
disease. 

To “prevent” means to administer the vaccine to a subject 
who has not yet contracted the disease and/or who is not 
showing any symptoms of the disease to prevent or impair 
the cause of the disease (e.g. infection) or to reduce or 
prevent development of at least one symptom associated 
with the disease. 

The disease may be any disease caused by a coronavirus, 
such as a respiratory disease and and/or gastroenteritis in 
humans and hepatitis, gastroenteritis, encephalitis, or a 
respiratory disease in other animals. 

The disease may be infectious bronchitis (IB); Porcine 
epidemic diarrhoea; Transmissible gastroenteritis; Mouse 
hepatitis virus; Porcine haemagglutinating encephalomyeli- 
tis; Severe acute respiratory syndrome (SARS); or Blue- 
comb disease. 

The disease may be infectious bronchitis. 

The vaccine may be administered to hatched chicks or 
chickens, for example by eye drop or intranasal administra- 
tion. Although accurate, these methods can be expensive e.g. 
for large broiler flocks. Alternatives include spray inocula- 
tion of administration to drinking water but it can be difficult 
to ensure uniform vaccine application using such methods. 
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The vaccine may be provided in a form suitable for its 
administration, such as an eye-dropper for intra-ocular use. 

The vaccine may be administered by in ovo inoculation, 
for example by injection of embryonated eggs. In ovo 
vaccination has the advantage that it provides an early stage 
resistance to the disease. It also facilitates the administration 
of a uniform dose per subject, unlike spray inoculation and 
administration via drinking water. 

The vaccine may be administered to any suitable com- 
partment of the egg, including allantoic fluid, yolk sac, 
amnion, air cell or embryo. It may be administered below the 
shell (aircell) membrane and chorioallantoic membrane. 

Usually the vaccine is injected into embryonated eggs 
during late stages of embryonic development, generally 
during the final quarter of the incubation period, such as 3-4 
days prior to hatch. In chickens, the vaccine may be admin- 
istered between day 15-19 of the 21-day incubation period, 
for example at day 17 or 18. 

The process can be automated using a robotic injection 
process, such as those described in WO 2004/078203. 

The vaccine may be administered together with one or 
more other vaccines, for example, vaccines for other dis- 
eases, such as Newcastle disease virus (NDV). The present 
invention also provides a vaccine composition comprising a 
vaccine according to the invention together with one or more 
other vaccine(s). The present invention also provides a kit 
comprising a vaccine according to the invention together 
with one or more other vaccine(s) for separate, sequential or 
simultaneous administration. 

The vaccine or vaccine composition of the invention may 
be used to treat a human, animal or avian subject. For 
example, the subject may be a chick, chicken or mouse (such 
as a laboratory mouse, e.g. transgenic mouse). 

Typically, a physician or veterinarian will determine the 
actual dosage which will be most suitable for an individual 
subject or group of subjects and it will vary with the age, 
weight and response of the particular subject(s). 

The composition may optionally comprise a pharmaceu- 
tically acceptable carrier, diluent, excipient or adjuvant. The 
choice of pharmaceutical carrier, excipient or diluent can be 
selected with regard to the intended route of administration 
and standard pharmaceutical practice. The pharmaceutical 
compositions may comprise as (or in addition to) the carrier, 
excipient or diluent, any suitable binder(s), lubricant(s), 
suspending agent(s), coating agent(s), solubilising agent(s), 
and other carrier agents that may aid or increase the delivery 
or immunogenicity of the virus. 

The invention will now be further described by way of 
Examples, which are meant to serve to assist one of ordinary 
skill in the art in carrying out the invention and are not 
intended in any way to limit the scope of the invention. 


EXAMPLES 


Example 1—Generation of an IBV Reverse 
Genetics System Based on M41-CK 


A M41-CK full-length cDNA was produced by replace- 
ment of the Beaudette cDNA in the Vaccinia virus reverse 
genetics system previously described in PCT/GB2010/ 
001293 (herein incorporated by reference) with synthetic 
cDNA derived from the M41 consensus sequence. 

The IBV cDNA within recombinant Vaccinia virus (rVV) 
rVV-BeauR-Rep-M41 structure described in Armesto, 
Cavanagh and Britton (2009). PLoS ONE 4(10): e7384. 
doi:10.1371/journal.pone.0007384, which consisted of the 
replicase derived from IBV Beaudette strain and the struc- 
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tural and accessory genes and 3' UTR from IBV M41-CK, 
was further modified by replacement of the Beaudette 5' 
UTR-Nsp2-Nsp3 sequence with the corresponding sequence 
from IBV M41-CK. The resulting IBV cDNA consisted of 
5' UTR-Nsp2-Nsp3 from M41, Nsp4-Nsp16 from Beaudette 
and the structural and accessory genes and 3' UTR from 
M41. This cDNA was further modified by the deletion of the 
Beaudette Nsp4-Nsp16 sequence. The resulting cDNA, 
lacking Nsp4-16, was modified in four further steps in which 
the deleted Nsps were sequentially replaced with the corre- 
sponding sequences from M41-CK, the replacement cDNAs 
represented M41-CK Nsp4-8, Nsp9-12, Nsp12-14 and 
finally Nsp15-16. Each replacement cDNA contained 
approx. 500 nucleotides at the 5' end corresponding to the 3' 
most M41 sequence previously inserted and approx. 500 
nucleotides at the 3' end corresponding to the M41 S gene 
sequence. This allowed insertion of the M41 cDNA 
sequence by homologous recombination and sequential 
addition of contiguous M41 replicase gene sequence. The 
synthetic cDNAs containing the M41-derived Nsp 
sequences were added by homologous recombination utilis- 
ing the inventor's previous described transient dominant 
selection (IDS) system (see PCT/GB2010/001293). The 
M41-derived cDNAs containing sequence corresponding to 
the M41 Nsps-10, -14, -15 and -16 contained the modified 
amino acids at positions 85, 393, 183 and 209, respectively, 
as indicated in FIG. 10. 

A full-length cDNA representing the genome of M41-CK 
was generated in Vaccinia virus representing the synthetic 
sequences. Two rIBVs, M41-R-6 and M41-R-12, were res- 
cued and shown to grow in a similar manner as M41-CK 
(FIG. 1). 


Example 2—Determining the Pathogenicity of 
Rescued M41 Viruses 


The viruses rescued in Example 1 were used to infect 
8-day-old specific pathogen free (SPF) chicks by ocular and 
nasal inoculation to test them for pathogenicity, as observed 
by clinical signs on a daily basis 3-7 days post-infection and 
for ciliary activity days 4 and 6 post-infection. Loss of 
ciliary activity is a well-established method for determining 
the pathogenicity of IBV. The two M41-R viruses were 
found to be apathogenic when compared to M41-CK though 
they did show some clinical signs in comparison to unin- 
fected control chicks (FIG. 2) and some but inconsistent loss 
in ciliary activity (FIG. 3). 

Thus, the M41-R molecular clones of M41-CK were not 
pathogenic when compared to the parental virus M41-CK. 

The inventors identified several nucleotide differences in 
the M41-R compared to the M41-CK sequences. The major- 
ity of these were synonymous mutations, as the nucleotide 
change did not affect the amino acid sequence of the protein 
associated with the sequence. However, four non-synony- 
mous mutations were identified in the IBV replicase gene 
specific to Nsp-10, Nsp-14, Nsp-15 and Nsp-16 components 
of the replicase gene, these mutations resulted in amino acid 
changes (Table 3). 


TABLE 3 


Non-Synonymous mutations identified in the Nsps of M41-R 
full-length genome 


Region of Nucleotide Nucleotide 

Replicase position Mutation Amino Acid Change 
Nsp10 12137 Ст Pro-Leu 
Nspl4 18114 ас Val=>Leu 
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TABLE 3-continued 


Non-Synonymous mutations identified in the Nsps of M41-R 
full-length genome 


Region of Nucleotide Nucleotide 

Replicase position Mutation Amino Acid Change 
Nspl5 19047 Т->А Leulle 
Nsp16 20139 GA Val—Ile 


Example 3—Repair of M41-R rIBVs 


In order to determine whether the identified mutations 
were responsible for the loss of pathogenicity associated 
with M41-R, the Nsp10 mutation was repaired and the 
mutations in Nsp-14, -15 & -16 were repaired and shown to 
grow in a similar manner as M41-CK (FIG. 9). The inven- 
tors thus generated the rIBVs, M41R-nsp10rep and M41R- 
nsp14, 15, 16rep, using synthetic cDNAs containing the 
correct nucleotides utilising the inventor's previous 
described (TDS) system (see PCT/GB2010/001293). 

The rIBVs were assessed for pathogenicity in chicks as 
described previously. Both rIBVs showed increased patho- 
genicity when compared to M41-R but not to the level 
observed with M41-CK (FIGS. 4 and 5). M41R-nsp14, 15, 
16rep gave more clinical signs and more reduction in ciliary 
activity than M41R-nsp10rep, overall these results indicated 
that the changes associated with the four Nsps appear to 
affect pathogenicity. 

To determine the roles of the Nsps in pathogenicity the 
full-length cDNA corresponding to M41R-nsp10rep was 
used to repair the mutations in Nsps14, 15 & 16 using a 
synthetic cDNA containing the correct nucleotides utilising 
the TDS system. 

The following rIBVs were produced: 

M41R-nsp10, 15гер-М41-К with the mutations in Nsp- 
10 and Nsp-15 repaired 

M41R-nsp10, 14, 15гер-М41-К with mutations in Nsp- 
10, -14 and -15 repaired 

M41R-nsp10, 14, 16гер-М41-К with mutations in Nsp- 
10, -14 and -16 repaired 

M41R-nsp10, 15, 16гер-М41-К with mutations in Nsp- 
10, -15 and -16 repaired 

M41-K—All four mutations, Nsp-10, -14, -15 & -16 
repaired in M41-R 

The rIBVs were shown to grow in a similar manner as 
M41-CK (FIG. 9) and assessed for pathogenicity as 
described previously. M41-K (in which all four mutations 
had been repaired) resulted in clinical signs and 100% loss 
of ciliary activity (complete ciliostasis) by 4 days post- 
infection (FIGS. 6, 7 & 8). The other rIBVs demonstrated 
varying levels of pathogenicity, apart from M41R-nsp10, 15, 
16rep, which was essentially apathogenic. These results 
confirmed that repair of all four Nsps restored pathogenicity 
to M41-R; again supporting the previous evidence that the 
mutations described in the four Nsps are implicated in 
attenuating M41-CK. 

The inventors also generated rIBV M41R-nsp 10, 14 rep 
(nsp 10 and 14 are repaired, nsp 15 and 16 contain muta- 
tions) and rIBV M41R-nsp 10, 16 rep (nsp 10 and 16 are 
repaired, nsp 14 and 15 contain mutations) and assessed the 
pathogenicity of these viruses. 

rIBV M41R-nsp 10, 14 rep less pathogenic than M41-K 
but caused around 50% ciliostasis on days 4-6 post-infec- 
tion. rIBV M41R-nsp 10, 16 rep was almost apathogenic and 
caused no ciliostasis (see FIG. 11a-c). 
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Thus the genome associated with M41-R is a potential 
backbone genome for a rationally attenuated IBV. 


Example 4—Vaccination/Challenge Study with 
M41-R 


Candidate vaccine viruses were tested in studies in which 
fertilized chicken eggs were vaccinated in ovo at 18 days 
embryonation and in which the hatchability of the inoculated 
eggs was determined. The clinical health of the chickens was 
investigated and the chickens were challenged at 21 days of 
age with a virulent IB M41 challenge virus at 10% EID; 
per dose. 

Clinical signs were investigated after challenge protection 
by the vaccine and a ciliostasis test was performed at 5 days 
after challenge to investigate the effect of the challenge 
viruses on movement of the cilia and protection by the 
vaccine against ciliostasis (inhibition of cilia movement). 

In Ovo Vaccination in Commercial Broiler Eggs 

The design of the experiment is given in Table 4 and the 
clinical results are given in Table 5. Hatchability of the eggs 
inoculated with IB M41-R was good and chickens were 
healthy. IB M41-R protected against clinical signs after 
challenge in the broilers (placebo: 19/19 affected, 1B M41- 
R: 3/18 affected and 1 dead). The results of the ciliostasis 
test are given in Table 6. IB M41-R generated protection 
against ciliostasis. 


TABLE 4 


Design of a hatchability, safety, efficacy study in commercial eggs 
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TABLE 6 


Results of the ciliostasis test after challenge, for design see Table 1. 


Treatment Protected/total Percentage protection 
Saline 0/19 0% 
IB M41R 5/18 28% 


In Ovo Vaccination in Specific Pathogen-Free (SPF) Eggs 


The design of the study in SPF eggs is given in Table 7 
and is similar with the design of the studies with commercial 
broilers, but the vaccination dose for 1B M41-R was higher, 
(10° EID,, per dose). 


The results (Table 8) show that the hatch percentage for 
IB M41-R hatch was low, and 19 of 40 hatched and the 
chicks were weak. Eight chicks died. The remaining 11 
chickens were challenged and 11 of the chicks hatched from 
the eggs which had been inoculated with saline were chal- 
lenged. 


In the ciliostasis test after challenge it appeared that all 
chickens vaccinated in ovo with IB M41-R were protected, 
whereas none of the controls was protected, see Table 9. 





ЕП.) Route Day(s) Day(s) End Nr. of 

Treatment per of of of of eggs per 
Treatment Description dose Admin Admin Challenge? Study treatment 
TO1 None NA NA NA NA NA 30 
T02 IB M41-R 104 In ovo 18 days At21days  At26 30 
NTX Saline NA In ovo embryo- of age, 20 days 30 

nation chickens of age 
per group 
Розе volume 0.1 ml, МА, not applicable. 
210365 EID5o per dose. 
TABLE 5 


Hatch percentages and clinical data before and after 


challenge in commercial chickens, for design see Table 1. 


Before After 
challenge challenge 
Hatch/ Vital/ Deaths/ — Symptoms/ — Deaths/ Symptoms/ 
Treatment total total total total total — total 
None 28/30 Euthanized directly after hatch for blood collection 
IB M41-R 28/30 28/28 1/20 0/19 1/19 3/1857 
Saline 29/30 29/29 1/20 0/19 019 19195254567 


"Disturbed respiratory system 
^Whizzing 

¿Change of voice 

‘Breathing difficult 

Swollen intra-orbital sinuses 
Uneven growth 

"Weak 
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Design of a hatchability, safety, efficacy study in SPF eggs 














EIDs9! Route Day Day End 
Treatment per of of of of 
Treatment Description dose Admin Admin Challenge? — Study 
TOI IB M41-R 10? Inovo 18 days At21 days  At26 
embryo- of age days 
T04 Saline NA In ovo nation of age 
NTX NA NA NA NA 
Розе volume 0.1 ml, NA, not applicable. 
?Challenge dose 1033 EIDs in 0.2 ml. 
TABLE 8 


Hatch percentages and clinical data before and after 


Nr. of 


eggs per 
treatment 


40 


40 
10 


challenge in SPF chickens, for design see Table 7. 








Before After 
challenge challenge 
Hatch/ Vital/ Deaths/ — Symptoms/  Deaths/ Symptoms/ 
Treatment total total total total total total 
IB M41-R 19/40 11/40 8/40 weak 0 0 
Saline 30/40 30/40 0 — 0 0 
NA 9/10 9/10 0 — — — 
TABLE 9 the batch of SPF eggs for viruses, as in other studies the level 





Results of the ciliostasis test after challenge, for design see Table 7. 





Treatment Protected/total Percentage protection 
Saline 0/11 0% 
IB M41R 11/11 100% 





In conclusion, IB M41-R was safe in commercial eggs, 
generated protection against clinical signs and to an extent 
against ciliostasis. 


In SPF eggs vaccinated with IB M41 R a relatively low 
number of chickens hatched. This may be due to the 105 
ЕШ, per egg of 1B M41-R used. This was 10-fold higher 
than the dose used in earlier studies in which there was a 
higher level of hatchability. The lower hatch percentages 
may also be caused by a particularly high susceptibility of 
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of embryo mortality was also higher that had previously 
been observed. 

After challenge all surviving chickens after hatch were 
completely protected against ciliostasis. It is concluded that 
IB M41-R has great potential as vaccine to be administered 
in ovo. 

All publications mentioned in the above specification are 
herein incorporated by reference. Various modifications and 
variations of the described methods and system of the 
invention will be apparent to those skilled in the art without 
departing from the scope and spirit of the invention. 
Although the invention has been described in connection 
with specific preferred embodiments, it should be under- 
stood that the invention as claimed should not be unduly 
limited to such specific embodiments. Indeed, various modi- 
fications of the described modes for carrying out the inven- 
tion which are obvious to those skilled in molecular biology, 
virology or related fields are intended to be within the scope 
of the following claims. 





SEQUENCE LISTING 


<160> NUMBER OF SEQ ID NOS: 13 
<210> 
<211> 
<212> 


<213> 


SEQ ID NO 1 

LENGTH: 27500 
TYPE: DNA 
ORGANISM: Infectious bronchitis virus 


<400> SEQUENCE: 1 


acttaagata gatattaata tatatctatc acactagcct tgcgctagat ttccaactta 60 
acaaaacgga cttaaatacc tacagctggt cctcataggt gttccattgc agtgcacttt 120 
agtgccctgg atggcacctg gccacctgtc aggtttttgt tattaaaatc ttattgttgc 180 
tggtatcact gcttgttttg ccgtgtctca ctttatacat ccgttgcttg ggctacctag 240 
tatccagcgt cctacgggcg cegtggctgg ttcgagtgcg aagaacctct ggttcatcta 300 


geggtaggeg 
ggggtcacct 
cgcataaggt 
aggttctggt 
ctaaaacagg 
gaacaacttt 
gcttttgcag 
gaaactgtgt 
gcaaaagtct 
cectttgcaa 
gaaacactgg 
tggcttcagg 
gaagtaactg 
tttaaacaac 
gaattaccac 
actgttgtgg 
agcattaatg 
aaaattttca 
ссааасасас 
gttgttgtgc 
gtggagtcag 
ggtgatcgct 
tgtgagcgtg 
cttgcagcaa 
gaaccattta 
acattagcga 
aaagtagaag 
tatggaaaag 
gcgattgtga 
atctataaag 
ggtttttgtg 
aaaggagttg 
agttttaaga 
gtgcacaaaa 
gttgaagatc 
acacttccag 
atgttcttcc 
gctattaatg 
caagaaatac 


ccatggaata 


ggtgtgtgga 
ссссссасас 
eggctatacg 
ggtgtttagt 
gagtatctcc 
gtgacgcttt 
ttaggcagaa 
gtggtctctt 
taaaagccac 
gaaaatatcg 
atgctegtgc 
tggcagcaaa 
caaaagtcat 
aaatagtcag 
agcgtattgc 
ttatggagag 
gtgctgttgc 
ctacacttgc 
cgagaggcac 
gtggcatgcg 
aaggttggtc 
tttacgcggc 
ttgtctgtct 
tctactcttc 
agtttctggg 
aggctgctac 
atgtttggtc 
tgcgtaatct 
ttttagcgac 
taggtggtct 
cacagttgaa 
cacagcattg 
agtgtgcact 
ttattcaaga 
tgggtgttgt 
agaaccaacc 
gcttcaaaaa 
tggtttgcaa 
caccacctga 


caatcttcaa 
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agtagcactt 
acctctaagg 
acgtttgtag 
gagcagacat 
caaactaagg 
gtttttctat 
gtttgatcgt 
cctcttgaag 
ttctaagttg 
tgaacttttg 
acaaactctt 
aatccaagtt 
ggatgctttg 
aatttttcaa 
agcacttaag 
gactctagtt 
aaaattcttt 
cttctttagg 
taaagggttt 
aaatgactta 
tgcaattttg 
acctctttca 
ttctgatggt 
ttttagtgtc 
tcataaattt 
tattgcagat 
ttcacttact 
cgaagaattt 
agtgcttgga 
ttttactaaa 
aagagctaag 
ttttcaacta 
tggtagaatc 
gggegatgaa 
tcaagaaaaa 
cggtcatatg 
ggatgagaac 
agcaggcggt 
tgttgtgttt 


aaaggcttat 
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cagacgtacc 
gcttttgagc 
Sgggtagtgc 
acaatagaca 
gatgtcattc 
acgtcacaca 
aatctgcaga 
ggagttgaca 
gcagatttag 
aagacagcat 
gatgaaattt 
teggctatgg 
ggctcaaata 
aaagcgctgg 
atggcttttg 
gttagagagt 
gaagaactcc 
gaggctgcag 
gaagtcgttg 
acactgcttg 
ggtggacatc 
ggaaattttg 
gtaacaccgg 
gcagaacttg 
gtgtatgcaa 
gttttgaagc 
gaaaagtctt 
gttaagactt 
gagggcattt 
gttgttgact 
ctcattgtca 
ttgctggatg 
catggagact 
atttggtttg 
ttgattgatt 
gttcaaatcg 
atttattata 
aaaactgtca 
attaaggtta 


aaggagccca 


-continued 


ggttctgttg 
ctagcgttgg 
сааасаассс 
gtgacaacat 
ttgtatccaa 
accctaagga 
ctgggaaaca 
aaataacacc 
aagacatctt 
gccagtggtc 
ttgaccctac 
cgatgcgcag 
tgagtgctct 
ctatttttga 
ctaagtgtge 
tcgcaggaac 
caaatggttt 
tgaaaattgt 
gtaatgccaa 
accaaaaagc 
tttgctatgt 
cattgcatga 
agataaatga 
tggcagccat 
aggatgcagc 
tgtttcaatc 
ttgaattctg 
gtttttgtaa 
ggcatcttgt 
tttgtgaaaa 
ctgaaaccct 
caatacagtt 
tgctcttctg 
acgccattga 
ttgatgtttg 
aggatgacgg 
caccaatgtc 
cctttggaga 
gcattgagtg 


ttgaagtaga 


tgtgaaatac 
gctacgttct 
ctgaggtgac 
ggcttcaagc 
agacattcct 
ttacgctgat 
gttcaaattt 
tggegtccca 
tggtgtctct 
tcttactgta 
tgaaatactt 
gcttgttgga 
tttccagatt 
gaatgtgagt 
caagtccatt 
ttgtcttgca 
catgggtgct 
ggataacata 
aggtacacaa 
tgaaattcct 
ctttaagagt 
tgtgcattgt 
tggacttatt 
taaaaggggt 
agtttctttt 
agegegtgtg 
gaggcttgca 
ggctcaaatg 
ttcgcaagtc 
atattggaaa 
ctgtgttttg 
tatgtataaa 
gaaaggaggt 
tagtattgat 
tgataatgtg 
aaagaactac 
acagcttggt 
aactactgtg 
ttgtggtgaa 


gacagacctc 


360 


420 


480 


540 


600 


660 


720 


780 


840 


900 


960 


1020 


1080 


1140 


1200 


1260 


1320 


1380 


1440 


1500 


1560 


1620 


1680 


1740 


1800 


1860 


1920 


1980 


2040 


2100 


2160 


2220 


2280 


2340 


2400 


2460 


2520 


2580 


2640 


2700 
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acagttgaac 


ceggaggctc 
gatttggatt 
atcgaggaag 
actaattcag 
ceggcaagta 
attgttcata 
gtcaataact 
ggtgactggg 
gtcatatctg 
gctattgttg 
tttacacctg 
ctcatttctg 
gagcaagagc 
gttaaaccag 
ttggctgttg 
gcaaatgagc 
ccggactttg 
gtcacacctt 
ggagacagca 
gtggttaact 
tcaatagatg 
ctgagtcaag 
acggaggatg 
tttggacagg 
gaaatcctct 
caaaagtatg 
aattttgtta 
cttcaagctg 
ggtggagatc 
gatttttcag 
acaaatgcac 
aggggtcttg 


caatattcaa 


ttaccgtact 


gctgtaggga 


gatggtaagg 


attacagcaa 


catagtaagg 


aattgctctc 
cagaaccacc 
gcataaaatc 
aagatgcaga 
aatgtgaaga 
acaaatatcc 
aggacgctct 
getttgaagg 
gtgaggctgt 
agaaatcagt 
aagagcagga 
cagacctaga 
ctgtccctaa 
ctaccctagt 
ctacatgtga 
taattgccaa 
acatgtcgca 
ttgaatattg 
catttgttaa 
acttgcgtga 
atgttgtgcc 
ctatgcgcga 
aacacatcga 
gtgttaaata 
tttttgcaag 
ttatacccac 
taacatattt 
tattagagtg 
ctaaaattag 
ctacagactt 
atgctaattg 
ttcttaagaa 
aagcctgtat 
attgcccaac 
tattgctttt 
ctgttgtttt 
cttttgacaa 
tgtatacacg 


gtaaagctaa 
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tgtggtctat 
accatttgag 
atgccatctg 
agaatgtgac 
agaagatgag 
tctgcctctt 
cgatgttgtg 
ggctgttaaa 
tgatgcgcaa 
tgagggtttt 
aatagtacct 
agttgttaaa 
agaagaagtt 
tgttaaagca 
aaaacccaaa 
agcattggat 
tggtggtggc 
cgcggactat 
aggcattcaa 
gaagcttgtt 
agttctctca 
agcttttaaa 
ttatttcgat 
cegetctgtt 
aaataaggta 
aactgacaag 
gcaaacgctt 
gegtgacgga 
atttaaaggt 
tgttgcctgg 
gcttttggcc 
gtgtgtgtcg 
tcagccagtt 
ctgtggtgca 
tgctactgat 
cattggctct 
tcttgctaag 


tttttctctt 


agtagtaaaa 
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gagaaaatgt 
aatgtcacac 
atctatcgtg 
acggattcag 
gatactaaag 
gatgatgatt 
aatttaccat 
getcttcege 
gaacaattgt 
actggtagtt 
gttgttgaac 
gaaacagcag 
gtgtctcagg 
caacgtgaga 
tttttggagt 
gagtttaaag 
gttgcaaagg 
gttaagaaac 
tgtgtgaata 
gctgcttaca 
tcagggattt 
ggttgtgcca 
gcaacttgta 
gttttaaaac 
gtcttttcgg 
actattcttg 
gcgcagaaat 
aattgctgga 
tttcttgcag 
tgttatgcaa 
aatttagcag 
tgcaattgtg 
cgagcaccta 
agtagtacgg 
ggtcctgcta 
actaatagtg 
gatagaaaat 
aggagtgaaa 


gaagatgttt 


-continued 


gtgatgatct 
ttgttgataa 
attatgagag 
gtgatgctga 
tgttggctct 
atagcgtcta 
ctggtgaaga 
agaaagttat 
gtcaacaaga 
gtgatgcaat 
aaagtcagga 
aagaggttga 
agaaagagga 
agaaggctaa 
acaaaacatg 
agttctgcat 
caattgcaga 
atggtccaca 
atgttgtagg 
agagtgttct 
ttggtgtaga 
tacgegttct 
agcagaagac 
ctggtgattc 
ctgatgatgt 
aatattatgg 


gggatgttca 


ttagttcagc 


aagcatgggc 
gttgcaatgc 
aacattttga 
gtgttaagag 
atcttctaca 
atgaagtaat 
cagttgattg 
gecattgtta 
ttggaaggaa 
atcccctact 


ctaaccttgc 


caagctgttt 
gaatggtaaa 
cgatgatgac 
ggagtgtgac 
tatacaagac 
caatggatgt 
aacctttgtt 
tgatgttcta 
atcaactcgg 
ggctgaacaa 
tgtagttgtt 
tgagtttatt 
gccacaggtt 
aaagttcaaa 
tgtgggtgat 
tgtaaacgct 
cttttgtgga 
gcaaaaactt 
acctcgccat 
tgtaggtgga 
ttttaaaata 
tttattttct 
aatttatctt 
tttgggtcaa 
tgaggataaa 
tttagatgcg 
atatagagac 
aatagttctc 
taaactgttg 
taaagtaggt 
cgcagattac 
ttatgaactt 
ttttaaaacg 
agaagcttca 
tgatgaaaat 
tacacaagcc 
gtegecttac 
tgttgttgaa 


tactagttct 


2760 


2820 


2880 


2940 


3000 


3060 


3120 


3180 


3240 


3300 


3360 


3420 


3480 


3540 


3600 


3660 


3720 


3780 


3840 


3900 


3960 


4020 


4080 


4140 


4200 


4260 


4320 


4380 


4440 


4500 


4560 


4620 


4680 


4740 


4800 


4860 


4920 


4980 


5040 


S6 


aaagccagtt 
agtcttaaag 
gaagattcta 
aggtctaagg 
ccagtctact 
gctaattttg 
tgggaaaatg 
ctttggcgtg 
aaagctattg 
gctacatacg 
aagggtcttt 
aagacacttg 
agctataaga 
tacacaagta 
tctttatgtg 
gcaggtgatt 
catgcttata 
aattggcttt 
atttgttatt 
tttctagatt 
tatttctggc 
gtaacatgtg 
gttggtggac 
agacataatt 
cctgaagttg 
gettaccacg 
getgcaattc 
tttttaaaga 
ggttttatag 
gtctattatg 
caattaatag 
aatataatat 
ctgctttcta 
gaagtggaat 
gataagttga 
ttaagagtca 
gacagttgcc 
acaaagtctg 
aaggcaggtg 


tttgtctttt 


ttgacgatct 
tgcaggagac 
agttgccact 
atggttttac 
acccagtctt 
ttgttgggca 
cegagagett 
cagaacacct 
ttggatctag 
ttgccgataa 
gtggattcac 
tgttcttttt 
ttgtgttatg 
atccagtagt 
gtccttataa 
ttacttgtcg 
gcgtagaaca 
atttggtctt 
gtgttaagta 
ggtttgtaaa 
tcttttacaa 
aagtgtgcaa 
gcaagcaaat 
ggtattgtag 
ctggcgagct 
ttgtgtatga 
ctggtaagga 
aagctgtttt 
tgtgtaatac 
cgcaatatct 
tagagcctgt 
ctgtagatac 
ttactaaaga 
acactggtga 
cacctcgtga 
aaaatgctcc 
ttaaatattt 
gtgctaaaca 
gtgttattaa 


atatactttt 
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tactgacttt 
acctgataat 
gacacttaaa 
ttataagtta 
ggattctatt 
tccaaattat 
tgttaaaatg 
taataaacct 
tgttgttact 
agtaggtgat 
acgtggccat 
cttttatttc 
taaggtggtg 
gtttactgga 
tgactacggt 
tgtgtgttta 
aatttataag 
tctaatatta 
tttggtattg 
aacagttttt 
gatatacgta 
gagagttgca 
agtgcatgtt 
aaattgtgat 
ttctgaaaag 
ggcatgegtg 
taatgcatct 
tcttaaggag 
acagagtgcg 
gtgtaagcca 
gtctaagagt 
tgcagcttta 
сдаадааадсс 
eggttttact 
tagagggttt 
tccggtagta 
aatttcagct 
agttatttct 


taacactttt 


tacagcatgt 
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gaacagtggt 
cttgatgaat 
gttagaggta 
acacctgata 
agtcttaggg 
tatagtaagt 
ggttataaaa 
aatttggaga 
acgcagtgtg 
ggtgtagttc 
tttgaaaaga 
ttaaaggcta 
tttgctacct 
atacgtgtgc 
aaagattctt 
catgatagag 
gatgcagctt 
tttgttaagc 
agttcaactg 
acccatttta 
caagtgcatc 
сфсадсааса 
tacactaatt 
gattatggtc 
cttaagcgcc 
gttgatgatt 
tctgctgtta 
gcattgaaat 
catgcactag 
atacttatac 
gttatagata 
aattataagg 
gtagatatgg 
aatgtgatac 
ttgataaatg 
tggaagtttt 
actgtcaagt 
tgtcataccc 
aaatggttta 


tgtttgggtt 


-continued 


atgatagcaa 
atgtgtcatt 
tcaaatcagt 
ctgatgaaaa 
caatatgggt 
ctctccgaat 
ttgatggtgt 
gaatttttaa 
gtaaaatact 
gcaatattac 
aaatgtccct 
gtgctaagag 
tacttatagt 
tagacttcct 
ttgatgtgtt 
attcacttca 
ctggcattaa 
cagtggcagg 
tgttgcaaac 
attttatggg 
atatattgta 
ggcaagaggt 
ctggctataa 
accaaaatac 
atgttaaacc 
ttgttaattt 
agtgtttcag 
gtgaacaaat 
aggaagcaaa 
ttgaccaggc 
aagtgtgtag 
caggcacact 
ctatcttctg 
cgtcatatgg 
cagatgcttc 
ctgatcttat 
caggaggtcg 
agaaactgtt 
tgagttgttt 


actactatat 


catctatgag 
tacgacaaag 
tgttgacttt 
ttcaaaaaca 
tgaaggcagt 
tcccacgttt 
aactatgggc 
cattgctaag 
agttaaagca 
agatagaatt 
acaatttcta 
tttagtttct 
gtggtttata 
atttgaaggt 
acgctattgt 
tctgtacaaa 
ctttaattgg 
ttttgttatt 
tggtgtaggt 
agcgggattt 
ctgtaaggat 
tagcgttgta 
cttttgtaag 
atttatgtcc 
tacagcatat 
aaaatataag 
tgttacagat 
atctaatgat 
gaatgcagcc 
actttatgag 
cattttgtct 
tcgtgatgct 
ccacaatcat 
tatggacact 
tattgctaat 
taaattgtct 
tttctttata 
ggtagagaaa 
taaatggctt 


ggagatgaat 


5100 


5160 


5220 


5280 


5340 


5400 


5460 


5520 


5580 


5640 


5700 


5760 


5820 


5880 


5940 


6000 


6060 


6120 


6180 


6240 


6300 


6360 


6420 


6480 


6540 


6600 


6660 


6720 


6780 


6840 


6900 


6960 


7020 


7080 


7140 


7200 


7260 


7320 


7380 


7440 


S8 


aaaagttttg 
gttatagaca 
tttgttaatt 
gttacagttg 
tgggttatgg 
attcctacct 
ggtagttttt 
aatacacctc 
ggtagtatta 
caacaaatac 
agtgtatgtg 
tttaatgatg 
atgtttaata 
gcaactatgt 
caaggtgttt 
aatgcatttg 
ctctattgct 
cttgttttta 
attctttata 
aagttgtatg 
gttattcgtg 
tatctttctg 
ttgcaagctt 
gaggttgttt 
aaaaaactag 
ggcaataatc 
ggtaagttta 
gaagttgtaa 
gttttaattt 
gctaattgtg 
taccctgtca 
ggctcagttg 
gagttaccta 
gtagatgaag 
gcatggctct 
ttggagagta 
actccatttt 
tgtaaactce 


ggacaatata 


ttcaccccat 
aaggtgttat 
ttgacgcctt 
ttatagatgg 
atggtgttat 
ggtttaatag 
atacatctat 
aattgtattg 
ttectcatag 
tgcatacacc 
agtatactaa 
aatacattag 
tggttagtac 
ttttaatact 
ttaaagctta 
ttttgtgtgt 
atgcatcatt 
cctttggttt 
tgtacacacc 
atggcaacga 
gtactgaatt 
cgtatgctag 
gtcgtgcatg 
ataccccacc 
tttctcctag 
ttaatggact 
gtggtgacca 
ctcaaaatgg 
tacaaactgc 
gtgatagttt 
ctatgegttc 
gttttaatat 
atgcattaca 
aggttgcgca 
atgcggcaat 
ctactgtttc 
ccactagtac 
ttcgcactat 


attttgaaga 


S9 


gtatgatgta 
tagagagatt 
ttggggtaaa 
tgacgggaca 
gtttgtgcat 
agaaattgtt 
agcattattt 
ttttaatggc 
agtatacttc 
ctacatagtg 
accaggttac 
taaacctggc 
attctttact 
agttgttatt 
tgcgaccatt 
acatagttat 
ggttacaagt 
aatagtaccc 
gttggttttc 
gtttgttggt 
tgttaagctt 
acttaaatac 
gttagcttat 
gcgttactct 
tagtgctgtt 
gtggetgggt 
gtggggtgac 
tgttactttg 
agttgccaat 
cactatagct 
taatggtact 
agaaaagggt 
cactggaact 
aagagtgcca 
tattagtgtt 
tattgaagat 
tgctattact 
tatggtaaaa 


cgaattgaca 
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aactccacac 
gtgtcagaag 
tcatatgaaa 
gtagctgttg 
atgacacaga 
ggttacactc 
tctgctagat 
gacaatgatg 
caacctaatg 
aagtttgttt 
tgtgtgtcac 
gttttctgtg 
ggtgtcaacc 
gtgttaattt 
gtgtttacaa 
aatagtgttt 
cgcaatactg 
acatggttgg 
tggtgttacg 
aattatgacc 
acgaatgaga 
tattcaggca 
gctttggacc 
attggtgtta 
gagaagtgca 
gattctattt 
gtactaaacc 
aatgttgtca 
gctgaaactc 
tgttcttatg 
attagagcat 
gtagttaatt 
gacctaatgg 
ccagataatc 
aaagaaagta 
tacaataggt 
aaattaagtg 
agtgctcaat 


ccagaatctg 


-continued 


tgcatgttga 
ataattgttt 
ataataaaaa 
gtgttcctgg 
ctgatcgtag 
aggattcaat 
gtttatattt 
cacctggagc 
gtgttaggct 
cagacagcta 
tagactccca 
gttctactgt 
ctaatattta 
ttgcaatggt 
taatgttagt 
tagctgttat 
ctataataat 
cttgttgcta 
gtactactaa 
ttgctgcgaa 
taggtgataa 
ctggtagtga 
aatatagaaa 
gtagactaca 
ttgttagtgt 
actgcccacg 
ttgctaataa 
gcaggcggct 
ctaagtataa 
gtggtacagt 
ctttcctagc 
tcttttatat 
gtgagtttta 
tagttactaa 
gtttttcaca 
gggctagtga 
ctataactgg 
ggggtagtga 


tatttaatca 


agggttcaaa 
ctctaataag 
ctgtccaatt 
ttttgtatca 
accttggtac 
tatcactgag 
aacagccagc 
cttaccattt 
tatagttcca 
ttgtagaggt 
atgggttttg 
tagagaactt 
tattcagcta 
tataaagttt 
ttgggttatt 
attattagta 
gcattgttgg 
tctgggattt 
aaatactcgt 
gagcactttt 
atttgaagcc 
gcaagattac 
tagtggtgtt 
egetggtttt 
ctcttataga 
ccatgtgtta 
tcatgagttt 
taaaggagca 
gtttgttaaa 
tataggactt 
aggagectgt 
gcaccatctt 
tggtggttat 
caatattgta 
acctaaatgg 
taatggtttt 
ggttgatgtt 


tcccatttta 


agttggtggt 


7500 


7560 


7620 


7680 


7740 


7800 


7860 


7920 


7980 


8040 


8100 


8160 


8220 


8280 


8340 


8400 


8460 


8520 


8580 


8640 


8700 


8760 


8820 


8880 


8940 


9000 


9060 


9120 


9180 


9240 


9300 


9360 


9420 


9480 


9540 


9600 


9660 


9720 


9780 


60 


gttaggttac 
ttagcttgct 
ttttatgtac 
aaacatgtta 
ggagtttgtg 
ttaagccaat 
ttagtgttgt 
tctttgttaa 
actcttactg 
gtgttggcta 
tggattttat 
gttaatggca 
gtttttggtt 
atgtgtttgc 
cttatacaag 
agtgatgtaa 
gcaaattcaa 
gatgttggag 
tctactattg 
tcggttactc 
atttatgaaa 
gcatatcgta 
aagaagttag 
actgatagaa 
aaaatagatt 
ctagcaactg 
acgtgggtca 
gactgtgtta 
acttactgta 
gggcataata 
acaaaggctt 
tatacaagta 
aaagtagcct 
tgtaaatttg 
aatacgaggt 
caatctaaag 
tttgcagtag 
ggtaactgtg 
ccaagtccaa 


gcacatatag 


agtcttcttt 
tcttgtttgt 
atgcagctgt 
tggcatacat 
ctgaagtccc 
ggtatgatcc 
acactgcttt 
tgctgtatca 
catatacaga 
atgttagtag 
attattgcaa 
taggctggct 
taaccttagg 
ataaggtaaa 
gaattggagg 
agtgtacaac 
aaatgcatgc 
agtgcatgga 
atttgggtga 
aagagttttc 
aggttttagc 
aagctgccaa 
atagcatggc 
gagcaaaatt 
ctgagaagct 
ttccaattgt 
agtgtgtgga 
ctgatgccga 
taagtggtga 
aggttgatgt 
gegtagcagg 
ttagtggcag 
cttttttgaa 
gtatgaaagt 
ctattgtaag 
gtcatgagac 
atcctgcgga 
ttaaaatgtt 
ctccggatca 


cacaccctgg 


61 


tgtaagaaaa 
gttgtgtgct 
tattttgttg 
ggacactttc 
tttcatatac 
tgtagtcttt 
taagtgtgta 
gtttatgaag 
aggtaattgg 
taattcctta 
tgcaacatac 
ttgcacctgt 
taaatacaat 
tccacctaaa 
cgatcgtgtg 
tgttgtttta 
ttatcttgtt 
taatttattg 
gtattgtgat 
gcacatacce 
cgattctaaa 
tattgcaaag 
agaacgtgct 
agtttcatca 
taatgtctta 
ttgtagtaat 
gggtgtgeat 
tggcacagag 
taatatagca 
tgecttgcaa 
tgtagatcaa 
ttcagttgta 
tgaggcaggt 
gggtgataag 
aggtatggta 
agaggaagtg 
tacatattgt 
gacagtacat 


ggattcttat 


cggagcagga 
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gctacatctt 
attgtcttat 
atggctgtgc 
ctattgccta 
aatactctaa 
gatactatgg 
caaggctgct 
ttaggttttg 
gagttattct 
attggtttaa 
tttaataatt 
tactttggat 
tttaaagttt 
actgtgtggg 
ttgcctatag 
atgcagcttt 
gagttacaca 
ggtatgctta 
gatatactta 
tcgtatgctg 
aatggtggtg 
tcagtttttg 
atgacaacaa 
ttacatgcac 
tttgaccagg 
aagcttaccc 
gttacatatt 
ttacacccca 
tggcctttaa 
aataatgagc 
gcacattgta 
gctgctatta 
aatcagattt 
gttgaagttg 
cttggtgcta 
gatgctgtag 
aaatatgtgg 
aatggtagtg 
ggaggagett 


aatttagatg 


-continued 


ggttttggag 


ttacggcagt 
tctttatttc 
cattgattac 
ttagtcaagt 
taccatggat 
atatgaattc 
ttatttacac 
ttgagttggt 
ttgtttttaa 
atgtgttaat 
tgtattggtg 
cagtagatca 
aggtctttac 
ctacagtgca 
tgactaagct 
ataaaatcct 
taacactatt 
agaggtcaac 
aatatgaaag 
taacacagca 
atagagactt 
tgtataaaga 
tacttttttc 
cgaatagtgg 
ttgttatacc 
caacagttgt 
cttctacagg 
aggttaactt 
ttatgcctca 
gcgttgagtc 
cctcttcaaa 
atgtagactt 
tttacctgta 
tatctaatgt 
gcattctctc 
cagcaggtaa 
gttttgcaat 
ctgtgtgtct 


gacgctgtca 


tagatgtgta 
gccacttaag 
ttttactgtt 
agttattatt 
tgttattttc 
gttattgcca 
tttcaatact 
ctcttcaaac 
tcacactatt 
gtgtgctaag 
ggcagtcatg 
ggttaataaa 
atataggtat 
tacaaatata 
atctaaattg 
taatgttgaa 
cgcatctgat 
ttgtatagat 
tgtattacaa 
agctaagagt 
agagcttgct 
ggctgttcaa 
ggegegtgta 
aatgcttaag 
tgttgtaccc 
agacccagag 
ttggaatata 
tagtggattg 
gactaggaat 
cggtgtaaag 
taaatgttat 
tcctaatctg 
agacccacca 
ttttataaaa 
tgttgtgtta 
actttgttct 
tcaaccttta 
aacatcaaag 
ttattgtaga 


atttaaaggt 


9840 


9900 


9960 


10020 


10080 


10140 


10200 


10260 


10320 


10380 


10440 


10500 


10560 


10620 


10680 


10740 


10800 


10860 


10920 


10980 


11040 


11100 


11160 


11220 


11280 


11340 


11400 


11460 


11520 


11580 


11640 


11700 


11760 


11820 


11880 


11940 


12000 


12060 


12120 


12180 


62 


tcttttgtgc 
gtttgcactg 
cctaaacctt 
gggtacgggg 
ttgtaaagcg 
agcgtaactg 
gtgattctta 
gttatgaaga 
acatttataa 
ctttgcggca 
gttgtataga 
tagaaaaccc 
tattgaatgc 
cacttgataa 
cgcctggtgc 
ccatgactga 
aatcttatga 
actttaagta 
gtttgataca 
tcggtaattt 
atcattctaa 
tgggtttgag 
ataaattagt 
ctcatcaaac 
ctggtatgtt 
gtaatgctgc 
tacgtcaact 
gctgtatacc 
tcaataagtt 
tctttgagag 
ccatatccge 
ctaataggca 
tagttattgg 
agggtgttga 
ctaatttgtt 
cttggtctga 
tcttagctac 
ctgcttatgc 


ttttgagtgt 


aaatacctac 
tttgtcagtg 
ctgttcagtc 
tagcagtgag 
agcctttgat 
tgcacgattc 
ttttgtggtt 
cttaaagtca 
tattagtagg 
ctttgaccca 
agattatcac 
taaatattat 
tattgagttc 
ccaagatctt 
tggtgttcct 
tgcgttggca 
tctcctcaag 
ttgggatcaa 
ttgtgcaaac 
gtgtagaaag 
ggaacttggt 
tcaactcatg 
ggatcttaga 
ggtaaaacca 
taaggaaggt 
tataaacgat 
tttattttgt 
agcaagccaa 
tggaaaggcc 
tacaaagaag 
gaaaaataga 
gtttcatcag 
aacaaccaag 
agacccgatt 
gcgtatagca 
acgcgtttat 
aggtggtata 
aaacagtgtt 


tataacgegt 


63 


tacggagaaa 
ttggattggt 
agttgctgtt 
gctcggctga 
gtttgtaata 
caagaagtac 
aaacaaacca 
gaagtaacag 
cagaggctta 
aaggattgcg 
cctaagtggt 
gccatgttgg 
ggaaacctca 
aatggcaaat 
gtttttgata 
cctgagaggt 
tatgattata 
gagtatcacc 
ttcaacatct 
gtttttgttg 
gttattatga 
cagtttgttg 
acgtcttgtt 
ggtcacttta 
tcttctatac 
tatgattatt 
ttagaagtga 
gttgtagtta 
cgtctctatt 
aacgtcctgc 
gegegtacag 
aagattctta 
ttttatggcg 
cttatgggtt 
gcatctttag 
aggttgtata 
tatgtgaaac 


ttcaacataa 


gatattgtat 
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gatcctgttg 
tatggatgtc 
gcatctggtt 
tacccctagc 
aggaatcagc 
gtgatactga 
ctcctagtaa 
ctgatcatga 
ctaagtatac 
aagttcttaa 
ttgaagagaa 
ctaaaatggg 
tggttgaaaa 
tttatgattt 
cgtattattc 
attttgaata 
ctgaggagaa 
ctaactgtcg 
tgttttctac 
atggtgtacc 
atcaagataa 
gagatcctgc 
ttagtgtttg 
acaaggattt 
cacttaaaca 
atcgttataa 
cttctaaata 
acaatttaga 
atgaaatgag 
ctactataac 
tggcaggtgt 
agtctatagt 
gttgggataa 
gggattatcc 
tactcgctcg 
atgaatgcgc 
ctggtggtac 
tacaagccac 


atgatgacat 


-continued 


gattctgtct 
agtgtgattc 
ttgataagaa 
taatggatgt 
cggtatgttt 
agatggaaat 
ttatgaacat 
tttctttgtg 
tatgatggat 
agaaatactt 
taaggattgg 
acctattgta 
aggttatgtt 
tggtgatttt 
ttacatgatg 
tgatgtgcat 
acaagatttg 
cgactgtagt 
acttgtaccg 
atttatagct 
caccatgtca 
cttgttagtg 
tgctttagcg 
ctacgatttt 
tttcttctac 
caggcctacc 
ttttgaatgt 
taagagtgca 
tctagaggag 
tcagatgaat 
gtctatcctt 
caacactaga 
catgttgaga 
aaagtgtgat 
taaacacact 
tcaggtttta 
tagcagtgga 
atctgctaat 


taagagcttg 


acgtaacaag 
acttagacaa 
ttatttaaac 
дассссдаїд 
caaaatttga 
cttgagtatt 
gagaaagctt 
ttcaataaga 
ttttgctatg 
gtcacttatg 
tacgacccaa 
cgacgtgctt 
ggtgttatta 
cagaagacag 
cccatcatag 
aagggttata 
tttcagaagt 
gatgacaggt 
cagacttctt 
acttgtggct 
ttttcaaaaa 
gggacatcca 
tctggtatta 
gcagagaagg 
ccacagactg 
atgtttgata 
tatgaaggcg 
ggttatccgt 
caggaccaac 
ttaaaatatg 
tctactatga 
aacgctcctg 
aaccttattc 
agagcaatgc 
aattgttgta 
tctgaaactg 
gatgctacta 
gttgcgegtc 


cagtatgaat 


12240 


12300 


12360 


12420 


12480 


12540 


12600 


12660 


12720 


12780 


12840 


12900 


12960 


13020 


13080 


13140 


13200 


13260 


13320 


13380 


13440 


13500 


13560 


13620 


13680 


13740 


13800 


13860 


13920 


13980 


14040 


14100 


14160 


14220 


14280 


14340 


14400 


14460 


14520 


64 


tgtaccagca 
cttatttgtg 
acaacacatt 
actatcagaa 
aaggcccaca 
gatacttgcc 
tggataagac 
acccactagt 
acatcagaaa 
tggatataga 
ccectacaac 
gctgtggtaa 
tgcacacaga 
gtggtgaage 
ataaaccaaa 
ctaattgtgc 
ctactgtgga 
cagagacagt 
gagaagtact 
cattgaatag 
tcggtgattt 
ctactgctaa 
ttatagcgcc 
atgtgatggt 
agaagcgtac 
tggeggctta 
atgctttatg 
ctcaaaggac 
acatttttag 
aggttagtat 
atgttgtgta 
cactctctce 
ttttccttgc 
tatatgatgg 
ttaataatgg 
tagaatttgt 
caccttataa 
tagactcgtc 
agcatgcact 


tagttgtcat 


ggtttatagg 
taagaatttc 
agccaaacaa 
caatgttttt 
tgaattttgt 
atatccagac 
agaatctgtg 
acatcatgaa 
actctatcaa 
taagggtagt 
attacagtct 
ttgtattcgc 
ccacaaaaat 
agatgttact 
gttatcaata 
aggtagcgaa 
accttatatt 
aaaagctaca 
ctcagatcgt 
aaattatgtt 
tacatttgaa 
attgtctgtt 
aacgttgtgt 
acctgegtgt 
tacagtacaa 
ctttagtaac 
tgaaaaagct 
tactatcgat 
tactattaat 
gttgaccaat 
tgtaggtgat 
aaaggattat 
aaagtgttac 
aaagtttatt 
taattetgat 
gaaagatttt 
tgctatgaac 
tcaaggtteg 


gaatattaac 


gegtcagcgt 


65 


cgagtcaatt 
tcattgatga 
ggtcttgtag 
atggctgatt 
tcacagcaca 
ccatcacgta 
getgttatgg 
aatgaggagt 
gagctttctc 
aaattttggg 
tgtggegttt 
aaaccatttt 
gttttgtcta 
aaattgtacc 
ccgttagtat 
aatgttgatg 
ttggcaaatc 
gaagaattac 
gaattgattc 
ttcactggct 
aaaggtgaag 
ggagacattt 
cctcagcaaa 
tttgtaaata 
ggcectectg 
gecegtgteg 
tttaagttte 
tgcttctcta 
gecttgccag 
tacgaattgt 
cctgctcaat 
aatgttgtca 
cgttgtccta 
gcaaataacc 
gtaggacatg 
gtctgtcgca 
cagagagcct 
gagtatgatt 
agattcaatg 


gatgaactat 
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ttgacccagc 
tcttgtctga 
cagatatttc 
ctaaatgttg 
caatgttagt 
ttttgtgtgc 
agcgttatat 
acaagaaggt 
agaatatgct 
aacaggagtt 
gtgtagtgtg 
tgtgttgtaa 
taaatcctta 
tcggaggtat 
ctaatggtac 
attttaatca 
gttgtgtaga 
ataagcaaca 
tgtcttggga 
ttcactttac 
gtaaggacgt 
ttgttttaac 
ccttttctag 
acattccatt 
gcagtggtaa 
tttttactgc 
ttaaagtaga 
agtttaaagc 
aagttagttg 
cttttattaa 
taceggegec 
caaaccttat 
aagaaattgt 
cggaatcacg 
aaagtggctc 
ataaggaatg 
accgtatgct 
atgttatctt 
tagcgcttac 


attcagctct 


-continued 


atttgttgaa 
cgacggtgtt 
tggttttaga 
ggttgaacca 
ggaggttgat 
atgtgttttt 
egetcttgec 
attctttgtg 
tatggactac 
ctatgaaaat 
taatagtcaa 
gtgttgctat 
catttgctca 
gtcatacttc 
agtgtttgga 
actagctact 
ttcgttgaga 
atttgctagt 
gccaggtaaa 
tagaactagt 
tgtctattat 
ctcacacaat 
gtttgtgaat 
gtaccattta 
atcccatttt 
atgctctcat 
tgattgcact 
taatgacaca 
tgacattctt 
tggtaagata 
tcgtacgttg 
ggtttgtgtt 
agatactgtt 
tcagtgtttc 
agcctacaac 
gegggaagca 
tggacttaat 
ttgtgttact 
aagagccaag 


taagtttata 


aagttttatt 
gtttgttata 
gaagttctct 
gatttagaaa 
ggtgagccta 
gtagatgatt 
atagatgcgt 
cttctttcat 
tcttttgtaa 
atgtatagag 
actatattgc 
gaccatgtca 
cagccaggtt 
tgcggtaatc 
atttacaggg 
actaattggt 
cgctttgctg 
gcagaagtga 
accaggcctc 
aaagttcagc 
egagegacgt 
gttgtttctc 
ttaagaccta 
gtaggcaagc 
gctataggat 
gcagctgttg 
cgtatagtac 
ggcaaaaagt 
ttggttgacg 
aactatcaat 
cttaacggtt 
aaacctgaca 
tctactcttg 
aaggttatag 
ataactcaat 
acattcattt 
gttcagacag 
gcagattcgc 
cgtggtatac 


gagcttgata 


14580 


14640 


14700 


14760 


14820 


14880 


14940 


15000 


15060 


15120 


15180 


15240 


15300 


15360 


15420 


15480 


15540 


15600 


15660 


15720 


15780 


15840 


15900 


15960 


16020 


16080 


16140 


16200 


16260 


16320 


16380 


16440 


16500 


16560 


16620 


16680 


16740 


16800 


16860 


16920 
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gtgtagcaag 
ttcacccagc 
aacttgctgc 
ctttgttagg 
gtgatgaggc 
ettgeggtac 
actttgtagt 
tgaattctaa 
ctaaaccttg 
acgtttcaga 
gctattttgt 
ttaattctca 
ataatccact 
atgatttgca 
egegttgtct 
ctcatatagc 
ttaatgcatg 
gtataaaatg 
tacccaatgt 
gtctttgtat 
ggtacgacac 
atgttaacaa 
aagctatgcc 
ttgcgcaaga 
gtgctgtttg 
ctgttactgc 
aaagtttttc 
attatgatgc 
tagatcaagg 
cgtttgagct 
gtttgggtgt 
tataccgtaa 
tgctgtatga 
tagtttctac 
ttcagaacgg 
gtgegtttgt 
aacctcgtag 
agtatggtaa 


taggtggttt 


tctgcaaggt 
ttatgcagtc 
acttgttaac 
gtttaagatg 
tatccgcaac 
taacattggt 
tacgectgag 
agcacctcca 
gcatgttgta 
ttgtgtagtg 
taaaataggc 
tactcaggct 
cttagtggat 
ttgtaatgtg 
tgcaattaat 
aaatgaggat 
tgttgatgct 
tgttagacgt 
caagcagttt 
gttttggaat 
acgaaatttg 
gcatgcattc 
attctttttc 
ccttgtgtca 
taaaaagcac 
tggttttact 
agctctccag 
tattgcagga 
cgtagaaaaa 
gtatgcgaag 
agatgtgact 
tactgttaag 
tgatagatat 
acagtgttac 
tattcegtta 
tacgctacct 
tgatgttgag 
agaattaggt 


acacactgtt 
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acaggcttgt 
acaactaagg 
gtggaagctg 
agtgttaatg 
gtaagaggtt 
actaacctgc 
ggacttgtag 
ggtgaacaat 
aggccaagga 
tttgtcacgt 
aaggaccaag 
tatgcttgtt 
attcaacagt 
catggacacg 
aatgcatttt 
gaagtcaatt 
cttaaagtta 
ggagacttaa 
gagtatgact 
tgtaatgtgg 
agtgtgttta 
cacacaccta 
tatgactcat 
ttagctacga 
gcacaaatgt 
ttttgggtta 
tctatcgaca 
gaaatgccca 
gcagtttttt 
agaaatattc 
aatggatttg 
gtatgtgcat 
ggtgattacc 
aagcggtatt 
aaagatggag 
aacacattaa 
cgtgattttc 
ctacagcaca 


ataggtatgt 
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ttaaaatttg 
ctcttgctgc 
gttcagaaat 
ttgaaggctg 
gggtaggttt 
ctttccaagt 
atacttcaat 
ttaatcactt 
ttgtgcaaat 
ggtgtcatgg 
tttgttcttg 
ggaagcattg 
ggggttattc 
cacatgtagc 
gtcaagatgt 
ctagctgtag 
acgttgtcta 
attttagatt 
ataatcagca 
attgttatcc 
acctacctgg 
aatttgatcg 
cgccttgcga 
aagattgtat 
atgcagattt 
ctaataattt 
atattgctta 
ctatcgtaac 
ttaatcaaac 
gcacactgcc 
taatttggga 
atacagacat 
agtcttttct 
cgtatgtaga 
cgaacctgta 
acacacaggg 
tcgacatgtc 
tactgtatgg 


gcagactttt 


-continued 


caacaaagag 
aacttataaa 
aacatataaa 
ccacaacatg 
tgatgtagaa 
aggtttctct 
aggcaataat 
gagagcgtta 
gttagcggat 
cctagaacta 
cggttctaga 
cttgggtttt 
tggtaaccta 
ttctgcggat 
caactgggat 
atatttacaa 
tgatataggc 
ctatgataag 
caaagataag 
cgacaattcc 
ttgtaatggt 
cactagcttt 
gaccattcaa 
cacaaaatgc 
tgtgacttct 
taacccatat 
taatatgtat 
tggagataaa 
aattctgcct 
aaacaaccgt 
ttacacgaac 
agaaccaaat 
agctgctgat 
aataccgtca 
tgtttataag 
tcgcagttat 
tgaggagagt 
tgaagttgat 


acgtgcgaat 


tttagtggtg 
gttaatgatg 
catcttattt 
tttataacac 
gcaacacatg 
actggtgcag 
tttgagcctg 
ttcaaaagtg 
aacctgtgca 
accactttgc 
gcaacaactt 
gattttgttt 
caatttaacc 
gctattatga 
ttaacttacc 
cgcatgtatc 
aaccctaaag 
aatccaatag 
tttgctgatg 
ttagtttgta 
ggtagcttgt 
cgtaatttga 
ttggatggag 
aacataggcg 
tataatgcag 
aatttgtgga 
aagggtggtc 
gtttttgtta 
acatctgtag 
attttgaaag 
caaacaccac 
ggcctaatag 
aatgctgttt 
aacctgcttg 
cgtgttaatg 
gaaacttttg 
tttgtagaaa 
aagccccaat 


aagttgaacg 


16980 


17040 


17100 


17160 


17220 


17280 


17340 


17400 


17460 


17520 


17580 


17640 


17700 


17760 


17820 


17880 


17940 


18000 


18060 


18120 


18180 


18240 


18300 


18360 


18420 


18480 


18540 


18600 


18660 


18720 


18780 


18840 


18900 


18960 


19020 


19080 


19140 


19200 


19260 
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caaagtctgt 
atggttccta 
ttcttaggaa 
ttgattacca 
atccacagct 
agaattgtgt 
gtggtattat 
caatgtgtgt 
tggctccagg 
atgatattgt 
ataagacaga 
aaagaaagca 
caagttttct 
caagttggca 
gtacagcagt 
caagtgaaaa 
attgtaatta 
gattgaaagc 
atttaattaa 
actcttttgt 
taccaaagtg 
gttaatattt 
attcatggtg 
atggcttggt 
tttgttacac 
tttttacgtg 
gctaagtacc 
aatggtgatc 
tttaaagctg 
tttgttaatg 
gcatgccagt 
ttagttaagc 
cacaatttca 
attcaaactt 
ctgagtagtt 
aattttagac 
gcttacggtc 
tgttatgctt 
gatcttaatt 


caaacagcca 


tactaattct 
caagcaagtg 
catactgaaa 
tagcataaat 
tcaatcagca 
tatggaacct 
gatgaatgtg 
accgcataat 
tagtactgtt 
agactatgtg 
gcacaagttt 
tgaaggcgtg 
tegtaataat 
cgaagtttta 
gaatgcctct 
ggttaaggtt 
tttacaaacc 
aacaccagtt 
gtgtggtaag 
gtgtactatg 
cctttagacc 
ctagcgaatc 
gtcgtgttgt 
ctagcagtca 
attgttataa 
tttctgctat 
ctacttttaa 
ttgtttacac 
gtggacctat 
gtactgcaca 
ataatactgg 
agaagtttat 
cttttcataa 
accaaacaca 
ttgtttataa 
tagaaactat 
ctcttcaagg 
attcatatgg 
ttgaatgtgg 


ctgaaccgcc 
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gattctgatg 
tgtactgttg 
gagtatggta 
tttatgactt 
tggacgtgtg 
tgcaacattc 
gcaaagtata 
atgegagtaa 
cttaaacaat 
tctgatgcac 
gatcttgtga 
atagccaata 
ttggctctag 
tatgacattg 
tcttcagaag 
agtggaaaaa 
tctgcttata 
gttaatttga 
ttactggtaa 
tagtgctgct 
acctaatggt 
taataatgca 
taatgcttct 
gttttgtact 
atatgatggg 
gaaaaatggc 
atcatttcag 
ctctaatgag 
aacttataaa 
agatgttatt 
caatttttca 
tgtctatcgt 
tgagactggc 
aacagctcag 
ggagtctaat 
taataatggc 
tggttgcaag 
aggtccttcg 
actgttagtt 


agttataact 
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tcatgcaaaa 
tggatttgct 
ctaataagtc 
ggtttgaaga 
gttataatat 
ctaattatgg 
cacaactctg 
tgcattttgg 
ggctcccaga 
atgtttctgt 
tatctgatat 
atggcaatga 
gtggtagttt 
cacaggattg 
cattcttggt 
cgctgcacgc 
gtatatttga 
aaactgaaca 
gagatgttgg 
ttgtatgaca 
tggcatttac 
ggctcttcac 
tctatagcta 
gcacactgta 
tgtcctataa 
cagcttttct 
tgtgttaata 
accacagatg 
gttatgagag 
ttgtgtgatg 
gatggctttt 
gaaaatagtg 
gccaacccta 
agtggttatt 
tttatgtatg 
ttgtggttta 
caatctgtct 
ctgtgtaaag 
tatgttacta 


cgacacaatt 


-continued 


ttattttgta 
gcttgatgat 
taaagttgta 
tggcattatt 
gectgaactt 
tgttggaata 
tcaatacctt 
agctggaagt 
agggacactc 
gctttcagat 
gtatacagac 
tgacgttttc 
tgctgtaaaa 
tgcatggtgg 
tggtgttaat 
aaattatata 
cgttgctaag 
aaagacagac 
taacacctct 
gtagttctta 
acgggggtgc 
ctgggtgtat 
tgacggcacc 
acttttcaga 
ctggcatgct 
ataatttaac 
atttaacatc 
ttacatctgc 
aagttaaagc 
gatcacctag 
atccttttat 
ttaatactac 
atcctagtgg 
ataattttaa 
gatcttatca 
attcactttc 
ttagtggtag 
gtgtttattc 
agagcggtgg 


ataataatat 


ttggcagaca 
ttcttagaac 
acagtgtcaa 
aaaacatgtt 
tataaagttc 
gegttgccaa 
tcgaaaacaa 
gacaaaggag 
cttgtcgata 
tgcaataaat 
aatgattcaa 
atatatctct 
gtgacagaga 
acaatgtttt 
tatttgggtg 
ttttggagga 
tttgatttga 
ttagtcttta 
tttactagtg 
egtttactac 
ttatgeggta 
tgttggtact 
gtcatcaggt 
tactacagtg 
tcaaaagaat 
agttagtgta 
egtatattta 
aggtgtttat 
cctggcttat 
aggcttgtta 
taatagtagt 
ttttacgtta 
tgttcagaat 
tttttccttt 
cccaagttgt 
agtttcaatt 
agcaacttgt 
aggtgagtta 
ctctcgtata 


tactttaaat 


19320 


19380 


19440 


19500 


19560 


19620 


19680 


19740 


19800 


19860 


19920 


19980 


20040 


20100 


20160 


20220 


20280 


20340 


20400 


20460 


20520 


20580 


20640 


20700 


20760 


20820 


20880 


20940 


21000 


21060 


21120 


21180 


21240 


21300 


21360 


21420 


21480 


21540 


21600 


21660 


70 


acttgtgttg 
gactcagctg 
ggttccatag 
ccttgcgaag 
acttcacgta 
aatggaacac 
agttatggta 
ttggaacagt 
tttaatttaa 
tgtctgcagt 
cctgtttgtg 
cttttgaatt 
gttagcactg 
cgttctttta 
gacgcataca 
cgtgaatata 
tatactagtt 
ccttttgcca 
ttgaagaatc 
ggttttagaa 
gctattctta 
attcaagaaa 
ataactggta 
agagtgtcac 
tctattaggt 
cctaatggta 
gcaatagtgg 
aatggtaggg 
tatatgccaa 
tatgtaagtg 
aatgacgaat 
ttcaattaca 
atacagggtc 
attaagtggc 
atactaggat 
attatgcctc 
gatgtggtaa 
ecttcctaat 


ttattatage 


attataatat 
ttagttataa 
acatctttgt 
atgtcaacca 
atgagactgg 
gtcgttttag 
agttttgtat 
ttgtggcacc 
ctgttacaga 
atgtttgtgg 
acaacatatt 
tctattcttc 
gtgagtttaa 
ttgaagacct 
aaaattgcac 
atggtttgct 
ctctagtagc 
cacaactgca 
aagaaaaaat 
gtacatctct 
ctgagactat 
tctaccagca 
gattgtcatc 
aacagcgtga 
actccttttg 
tagtgtttat 
gtttttgtgt 
gtatttttat 
gagctattac 
taaataagac 
tgtcaaaatg 
cagtacctat 
ttaatgactc 
cttggtatgt 
gggttttett 
taatgagtaa 
cttaacaata 
agtattaatt 


gctccaacaa 
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atatggcaga 
ttatctagca 
tgtacaaggt 
gcagtttgta 
ttctcagctt 
acgttctatt 
aaaacctgat 
tttacttaat 
tgagtacata 
caattctctg 
gtctgtagta 
tactaaaccg 
tatttctctt 
tctatttaca 
tgcaggacct 
tgtgttgect 
ttctatggct 
ggctagaatt 
tgctgcttcc 
agcattacaa 
ggcatcactt 
acttgacgcc 
actttctgtt 
gttagctact 
tggtaatgga 
acacttttct 
aaagccagct 
acaagttaat 
tgcaggagat 
cgtcattact 
gtggaatgac 
acttgacatt 
tttaatagac 
gtggttagcc 
catgactgga 
gtgtggtaag 
cagacctaaa 
tttetttggt 


ctaatacaag 
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actggccaag 
gacgcaggtt 
gaatatggtc 
gtttctggtg 
cttgagaacc 
actgaaaatg 
ggttcaattg 
gttactgaaa 
caaacgcgta 
gattgtagag 
aatagtattg 
gctggtttta 
ctgttaacaa 
agcgttgaat 
ttaggttttc 
cccattataa 
tttggtggta 
aatcacttgg 
tttaataagg 
caaattcaag 
aataaaaatt 
atacaagcaa 
ttagcatctg 
cagaaaatta 
cgacatgttc 
tatactccag 
aatgctagtc 
ggtagttact 
atagttacgc 
acattcgtag 
actaagcatg 
gatagtgaaa 
cttgaaaaac 
atagcttttg 
tgttgtggtt 
aaatcttctt 
aagtctgttt 
gtaaacttgt 


ttttactcca 


-continued 


gttttattac 
tggctatttt 
ttacttatta 
gtaaattagt 
agttttacat 
ttgcaaattg 
ccacaatagt 
atgtgctcat 
tggataaggt 
atttgtttca 
gtcaaaaaga 
atacaccatt 
ctcctagtag 
ctgttggatt 
ttaaggacct 
cagcagaaat 
ttactgcagc 
gtattaccca 
ccattggtcg 
atgttgttaa 
ttggtgctat 
atgctcaagt 
ctaagcaggc 
atgagtgtgt 
taaccatacc 
atagttttgt 
agtatgcaat 
acatcacagc 
ttacttcttg 
acaatgatga 
agctaccaga 
ttgatcgtat 
tttcaatact 
ccactattat 
gttgttgtgg 
attacacgac 
aatgattcaa 
actaagttgt 


aattatcaat 


taatgtaacc 
agatacatct 
taaggttaac 
aggtattctt 
taaaatcact 
cecttatgtt 
accaaaacaa 
acctaacagt 
ccaaattaat 
acaatatggg 
agatatggaa 
tcttagtaat 
tcctagaagg 
accaacagat 
сасасасасе 
gcaaattttg 
tggtgctata 
gtcacttttg 
tatgcaggaa 
taagcagagt 
ttcttctatg 
ggategtctt 
ggagcatatt 
taagtcacag 
gcaaaatgca 
taatgttact 
agtacccgct 
acgagatatg 
tcaagcaaat 
ttttgatttt 
ctttgacaaa 
tcaaggcgtt 
caaaacttat 
cttcatctta 
atgctttggc 
ttttgataac 
agtcccacgt 
tttagagagt 


agtaacttac 


21720 


21780 


21840 


21900 


21960 


22020 


22080 


22140 


22200 


22260 


22320 


22380 


22440 


22500 


22560 


22620 


22680 


22740 


22800 


22860 


22920 


22980 


23040 


23100 


23160 


23220 


23280 


23340 


23400 


23460 


23520 


23580 


23640 


23700 


23760 


23820 


23880 


23940 


24000 
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agcctagact 
aactggtgag 
aaacacagaa 
aatagagtca 
aaaatggaag 
taggtagagc 
catgggtagt 
gaaaacttaa 
ggaataataa 
tttgaacagt 
ttcttaacca 
aaaatgatag 
atatacccac 
ctgtcttttg 
tggtcattta 
tgtaattttg 
ctttattgtg 
tttgtttgta 
сааадсадаа 
ggcgagctag 
agagagtatt 
atttttgagg 
tttaaaaaac 
tttcaagtag 
tgtaggttgt 
aggcgtttta 
taagggcagt 
ggttaatctt 
ttcaattaga 
tagtaaagat 
aggattagat 
tacctctcta 
gtcatggcaa 


ggaggaccaa 


aaagccaaga 


gaaaatctaa 


ggtaaaggtg 


ccagccgcta 


ggtgctgata 


tatccgctac 


gaccctttgt 
caagtgattc 
gtatttgacc 
gctgaagatt 
ttttctaaca 
acttcaagca 
aattccagga 
caatccggaa 
aaatccagca 
cagttgagct 
taatacttca 
tgttatggtg 
caaacacagg 
taggttattg 
acccagaatc 
ctatagagag 
agggtcagtg 
caccggatag 
ataagaaacg 
aaagtgtagc 
taaaattatt 
atattaatat 
agtttttcca 
ataatggaaa 
ggttgagtta 
tcttacaagc 
tatttcatgt 
agatcatgga 
tttagtttat 
aatccttttt 
tgtgtttact 
gtattccagg 
geggtaagge 
agccacctaa 
agttaaattc 
aaccaagtca 
gaagaaaacc 
acctgaattg 
ctaaatttag 


ggttttcaga 
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cacagtctag 
aaaaaatcag 
cctttgacta 
gttcaggtga 
gegctttata 
tttgtacagg 
gctaagggta 
ttagaagcag 
aattttcaag 
ttttaaagag 
gtatggctat 
cttttggccc 
aggtcttgtc 
gatccagagt 
taatgccgta 
tgtgccaatg 
gcttgctaag 
acgtaatatc 
gtttgctacg 
aacaggaggg 
ctttaatagt 
aaatcctctc 
ctcttttgtg 
agtctactac 
taaaaaagat 
gcttaataaa 
tataaacccc 
ccaaaacaca 
aggttggcgt 
geggagcaat 
ttcttaacaa 
ggaaaacttg 
aactggaaag 
agttggttct 
acctcegect 
gcagcatgga 
agtcccagat 
gggtgatagc 


atctaatcag 


cggaggacct 
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actaatgtta 
tttcaattta 
ttgttattac 
tgatgaattt 
tatttgtagg 
ctgctgatgc 
cagcctttgt 
ttattgtcaa 
atgtccaacg 
tataatttat 
gcaacaagaa 
cttaacattg 
gcagcgataa 
attagactct 
ggttcaatac 
gtgctttctc 
tgtgaaccag 
taccgtatgg 
tttgtctatg 
agtagtcttt 
gcctctattt 
tgttttatac 
ccaaaaacta 
gaaggaaaac 
taaactacct 
tacggacgat 
tattattaac 
tcttaacgtg 
atacgcctac 
agcaagaaaa 
agcaggacaa 
tgaggaacac 
acagatgccc 
tctggaaatg 
aagtttgaag 
tattggagac 
gcttggtatt 


caagatggta 


ggtactcgtg 


gatggtaatt 


-continued 


aacttagaag 
cagcatattt 
agaggaggta 
attgaataag 
atttttagca 
ttgttgttta 
atataagtat 
cgagtttcct 
agacaaattg 
ttataactgc 
gtaagtttat 
cagtaggtgt 
tacttacagt 
ttaagcggtg 
tcctaactaa 
caattataaa 
accacttgcc 
tgcagaaata 
caaagcagtc 
acacctaaat 
taagagcgca 
tctcttttca 
ttgttgttaa 
caatttttca 
actacactta 
gaaatggctg 
tcaattaaga 
tgttaggtgc 
tcaatcgctg 
gcgcgaattt 
gcagagtctt 
aaatataata 
cagctccagt 
tatcttggtt 
gtagcggtgt 
gccaagctag 
tttactatac 
tagtgtgggt 
actctgacaa 


tccgttggga 


caattattga 
caagtgtatt 
atttttggga 
tcgctagagg 
ctttatcttc 
ttttggtata 
acatatggta 
aagaacggtt 
tactcttgac 
attcttgttg 
ttatatactg 
aatttcatgt 
gtttgcgtgt 
taggtcatgg 
tggtcaacaa 
gaatggtgtt 
taaagatata 
tactggtgac 
agtagatact 
gtgtgtgtgt 
taatagtatt 
agagctatta 
tggtgtaacc 
gaaaggttgt 
tttttataag 
actagttttg 
gtattagata 
gtgattttgt 
gtatgaataa 
atctgagaga 
gtcccgegtg 
ataatctttt 
catcaaacta 
tcaagcaata 
tcctgataat 


gtttaagcca 


tggaacagga 


tgctggtaag 


gtttgaccaa 


tttcattcct 


24060 


24120 


24180 


24240 


24300 


24360 


24420 


24480 


24540 


24600 


24660 


24720 


24780 


24840 


24900 


24960 


25020 


25080 


25140 


25200 


25260 


25320 


25380 


25440 


25500 


25560 


25620 


25680 


25740 


25800 


25860 


25920 


25980 


26040 


26100 


26160 


26220 


26280 


26340 


26400 


74 


US 10,130,701 B2 


75 


ctgaatcgtg gcaggagtgg gagatcaaca gcagcttcat 
ccatcacgtg aagtttcgcg tggtcgcagg agtggttctg 
gcagcaagga taattcagga tcagcagaag aagggttctc 
gatgaaatgg ctcaccgceg gtattgcaag cgcactattc 
caagtgtttg gtccccgtac taaaggtaag gagggaaatt 
gaggaaggta ttaaggatgg gecgcgttaca gcaatgctca 
gcttgtcttt teggaagtag agtgacgccc agacttcaac 
tttgaattta ctactgtggt cccacgtgat gatccgcagt 
tgtgatcagt gtgttgatgg tgtaggaaca cgtccaaaag 
tcacgctcaa gttcaagacc tgcaacaaga ggaaattctc 
cctaagaagg agaaaaagcc aaagaagcag gatgatgaag 
gatgaggaga ggaacaatge acagctggaa tttgatgatg 
ggggattcag ccctaggaga gaatgaactt tgagtaaaat 
agataggcat gtagcttgat tacctacatg tctatcgcca 
acttagtagc ctggaaacga acggtagacc cttagatttt 
ttagtttaag ttagtttaga gtaggtataa agatgccagt 
gaccgagggt acagcactag gacgcccatt aggggaagag 
agtttaattg gctatgtata gttaaaattt ataggctagt 
aaaaaaaaaa aaaaaaaaaa 

«210» SEQ ID NO 2 

«211» LENGTH: 435 

«212» TYPE: DNA 

«213» ORGANISM: Infectious bronchitis virus 
«400» SEQUENCE: 2 

tctaaaggtc atgagacaga ggaagtggat gctgtaggca 
gcagtagatc ctgcggatac atattgtaaa tatgtggcag 
aactgtgtta aaatgttgac agtacataat ggtagtggtt 
agtccaactc cggatcagga ttcttatgga ggagcttctg 
catatagcac accttggcgg agcaggaaat ttagatggac 
tttgtgcaaa tacctactac ggagaaagat cctgttggat 
tgcactgttt gtcagtgttg gattggttat ggatgtcagt 
aaaccttctg ttcag 

«210» SEQ ID NO 3 

«211» LENGTH: 1563 


«212» TYPE: DNA 
«213» ORGANISM: Infectious bronchitis virus 


«400» SEQUENCE: 3 

ggtacaggct tgtttaaaat ttgcaacaaa gagtttagtg 
gtcacaacta aggctcttgc tgcaacttat aaagttaatg 
aacgtggaag ctggttcaga aataacatat aaacatctta 


atgagtgtta atgttgaagg ctgccacaac atgtttataa 


-continued 


cagcagcatc 
aagatgatct 
gcattacaaa 
cacctaatta 
ttggtgatga 
acctagttcc 
cagatgggct 
ttgataatta 
atgatgaacc 
cagcgccaag 
tggataaagc 
aacccaaggt 
tcaatagtaa 
gggaaatgtc 
aatttagttt 
ассаадасса 
ctaaatttta 


atagagttag 


ttctctcact 
caggtaatca 
ttgcaataac 
tgtgtcttta 
gctgtcaatt 
tctgtctacg 


gtgattcact 


gtgttcaccc 


atgaacttgc 


tttctttgtt 


cacgtgatga 


tagtagagca 
tattgetegt 
ggctaaggct 
taaggttgat 
caagatgaat 
tagcagccat 
gcacttgaaa 
tgtaaaaatt 
aagaccaaag 
acagcagege 
attgacctca 
aattaactgg 
gagttaagga 
taatttgtct 
aatttttagt 
egeggagtac 
gtttaagtta 


agcaaaaaaa 


ttgttctttt 
acctttaggt 
atcaaagcca 
ttgtagagca 
taaaggttct 
taacaaggtt 


tagacaacct 


agcttatgca 


tgcacttgtt 


agggtttaag 


ggctatcegc 


26460 


26520 


26580 


26640 


26700 


26760 


26820 


26880 


26940 


27000 


27060 


27120 


27180 


27240 


27300 


27360 


27420 


27480 


27500 


60 


120 


180 


240 


300 


360 


420 


435 


60 


120 


180 


240 


76 


aacgtaagag 
ggtactaacc 
gagggacttg 
ccaggtgaac 
gtaaggccaa 
gtgtttgtca 
ggcaaggacc 
gcttatgctt 
gatattcaac 
gtgcatggac 
aataatgcat 
gatgaagtca 
gctcttaaag 
cgtggagact 
tttgagtatg 
aattgtaatg 
ttgagtgtgt 
ttccacacac 
ttctatgact 
tcattagcta 
cacgcacaaa 
actttttggg 


cag 


gttgggtagg 


tgcctttcca 
tagatacttc 
aatttaatca 
ggattgtgca 
cgtggtgtca 
aagtttgttc 
gttggaagca 
agtggggtta 
acgcacatgt 
tttgtcaaga 
attctagctg 
ttaacgttgt 
taaattttag 
actataatca 
tggattgtta 
ttaacctacc 
ctaaatttga 
catcgccttg 
cgaaagattg 
tgtatgcaga 


ttactaataa 


<210> SEQ ID NO 4 
<211> LENGTH: 1014 


<212> TYPE: 
<213> ORGANISM: 


DNA 


<400> SEQUENCE: 4 


tctatcgaca 


gaaatgccca 


gcagtttttt 


agaaatattc 


aatggatttg 


gtatgtgcat 


ggtgattacc 


aagcggtatt 


aaagatggag 


aacacaataa 


cgtgattttc 


ctacagcaca 


ataggtatgt 


atattgctta 


ctatcgtaac 


ttaatcaaac 


gcacactgec 


taatttggga 


atacagacat 


agtcttttct 


cgtatgtaga 


cgaacctgta 


acacacaggg 


tcgacatgtc 


tactgtatgg 


gcagactttt 


T] 


ttttgatgta 
agtaggtttc 
aataggcaat 
cttgagagcg 
aatgttagcg 
tggcctagaa 
ttgeggttct 
ttgettgggt 
ttctggtaac 
agettetgeg 
tgtcaactgg 
tagatattta 
ctatgatata 
attctatgat 
gcacaaagat 
tcccgacaat 
tggttgtaat 
tcgcactagc 
cgagaccatt 
tatcacaaaa 
ttttgtgact 


ttttaaccca 


taatatgtat 
tggagataaa 
aattctgect 
aaacaaccgt 
ttacacgaac 


agaaccaaat 
agctgctgat 
aataccgtca 
tgtttataag 
tcgaagttat 
tgaggagagt 
tgaagttgat 


acgtgcgaat 
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gaagcaacac 
tctactggtg 
aattttgagc 
ttattcaaaa 
gataacctgt 
ctaaccactt 
agagcaacaa 
tttgattttg 
ctacaattta 
gatgctatta 
gatttaactt 
caacgcatgt 
ggcaacccta 
aagaatccaa 
aagtttgctg 
tecttacttt 
ggtggtaget 
tttcgtaatt 
caattggatg 
tgcaacatag 
tcttataatg 


tataatttgt 


Infectious bronchitis virus 


aagggtggtc 


gtttttgtta 


acatctgtag 


attttgaaag 


сааасассас 


ggcctaatag 


aatgetgttt 


aacctgcttg 


cgtgttaatg 


gaaacttttg 


tttgtagaaa 


aagccccaat 


aagttgaacg 


-continued 


atgettgegg 
cagactttgt 
ctgtgaattc 
gtgctaaacc 
gcaacgtttc 
tgegetattt 
cttttaattc 
tttataatcc 
accatgattt 
tgacgegttg 
accctcatat 
atcttaatgc 
aaggtattaa 
tagtacccaa 
atggtctttg 
gtaggtacga 
tgtatgttaa 
tgaaagctat 
gagttgcgca 
geggtgctgt 
cagctgttac 


ggaaaagttt 


attatgatgc 
tagatcaagg 
cgtttgagct 
gtttgggtgt 
tataccgtaa 
tgctgtatga 
tagtttctac 
ttcagaacgg 
gtgegtttgt 
aacctcgtag 
agtatggtaa 
taggtggttt 


caaagtctgt 


tactaacatt 
agttacgcct 
taaagcacct 
ttggcatgtt 
agattgtgta 
tgttaaaata 
tcatactcag 
actcttagtg 
gcattgtaat 
tcttgcaatt 
agcaaatgag 
atgtgttgat 
atgtgttaga 
tgtcaagcag 
tatgttttgg 
cacacgaaat 
caagcatgca 
gecattettt 
agaccttgtg 
ttgtaaaaag 
tgctggtttt 


ttcagctctc 


tattgcagga 
сасадааааа 
gtatgcgaag 
agatgtgact 
tactgttaag 
tgatagatat 
acagtgttac 
tattccgtta 
tacgctacct 
tgatgttgag 
agaattaggt 
ccacactgtt 


tactaattct 


300 


360 


420 


480 


540 


600 


660 


720 


780 


840 


900 


960 


1020 


1080 


1140 


1200 


1260 


1320 


1380 


1440 


1500 


1560 


1563 


60 


120 


180 


240 


300 


360 


420 


480 


540 


600 


660 


720 


780 


78 


gattetgatg 


tgtactgttg 


gagtatggta 


tttatgactt 


tcatgcaaaa 


tggatttgct 


ctaataagtc 


ggtttgaaga 


«210» SEQ ID NO 5 
«211» LENGTH: 909 


«212» TYPE: 
«213» ORGANISM: 


DNA 


«400» SEQUENCE: 5 


tcagcatgga 
gaaccttgca 
aatgtggcaa 
cataatatgc 
actgttctta 
tatgtgtctg 
aagtttgatc 
ggcgtgatag 
aataatttgg 
gttttatatg 
gectettett 
aaggttagtg 
caaacctctg 
ccagttgtta 
ggtaagttac 


actatgtag 


cgtgtggtta 
acattcctaa 
agtatacaca 
gagtaatgca 
aacaatggct 
atgcacatgt 
ttgtgatatc 
ccaataatgg 
ctctaggtgg 
acattgcaca 
cagaagcatt 
gaaaaacgct 
cttatagtat 
atttgaaaac 


tggtaagaga 


«210» SEQ ID NO 6 
«211» LENGTH: 145 


«212» TYPE 


«213» ORGANISM: 


PRT 


«400» SEQUENCE: 6 


79 


ttattttgta 
gettgatgat 
taaagttgta 


tggcattatt 


taatatgcct 
ttatggtgtt 
actctgtcaa 
ttttggagct 
cccagaaggg 
ttctgtgctt 
tgatatgtat 
caatgatgac 
tagttttgct 
ggattgtgca 
cttgattggt 
gcacgcaaat 
atttgacgtt 
tgaacaaaag 


tgttggtaac 


Ser Lys Gly His Glu Thr Glu Glu 


Leu Cys Ser 


Phe Ala Val Asp Pro 


20 


Ala Ala Gly Asn Gln Pro Leu Gly 


35 


40 


His Asn Gly Ser Gly Phe Ala Ile 


50 


Asp Gln Asp 


65 


His Ile Ala 


55 


Ser Tyr Gly Gly Ala 


70 


His Pro Gly Gly Ala 


85 


Phe Lys Gly Ser Phe Val Gln Ile 


Gly Phe Cys 


100 


Leu Arg Asn Lys Val 
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ttggcagaca 


ttcttagaac 


acagtgtcaa 


aaaacatgtt 


Infectious bronchitis virus 


gaactttata 
ggaatagcgt 
tacctttcga 
ggaagtgaca 
acactccttg 
tcagattgca 
acagacaatg 
gttttcatat 
gtaaaagtga 
tggtggacaa 
gttaattatt 
tatatatttt 
gctaagtttg 
acagacttag 


acctctttta 


Infectious bronchitis virus 


Val Asp Ala 
10 


Ala Asp Thr 
25 


Asn Cys Val 


Thr Ser Lys 


Ser Val Cys 
75 


Gly Asn Leu 
90 


Pro Thr Thr 
105 


Cys Thr Val 


-continued 


atggttccta 
ttcttaggaa 
ttgattacca 


atccacagct 


aagttcagaa 


tgccaagtgg 
aaacaacaat 
aaggagtggt 
tcgataatga 
ataaatataa 
attcaaaaag 
atctctcaag 
cagagacaag 
tgttttgtac 
tgggtgcaag 
ggaggaattg 
atttgagatt 
tctttaattt 


ctagtgactc 


caagcaagtg 
catactgaaa 
tagcataaat 


tcaa 


ttgtgttatg 
tattatgatg 
gtgtgtaccg 
gccaggtagt 
tattgtagac 
gacagagcac 
aaagcatgaa 
ttttcttcgt 
ttggcacgaa 
agcagtgaat 
tgaaaaggtt 
taattattta 
gaaagcaacg 
aattaagtgt 


ttttgtgtgt 


Val Gly Ile Leu Ser 


Tyr Cys Lys 


30 


15 


Tyr Val 


Lys Met Leu Thr Val 


45 


Pro Ser Pro Thr Pro 


60 


Leu Tyr Cys 


Arg Ala 
80 


Asp Gly Arg Cys Gln 


95 


Glu Lys Asp Pro Val 
110 


Cys Gln Cys 


Trp Ile 


840 


900 


960 


1014 


60 


120 


180 


240 


300 


360 


420 


480 


540 


600 


660 


720 


780 


840 


900 


909 


80 


115 


81 


120 
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-continued 


125 


Gly Tyr Gly Cys Gln Cys Asp Ser Leu Arg Gln Pro Lys Pro Ser Val 


Gln 
145 


130 


<210> SEO ID NO 7 


<211> LENGTH: 


<212> TYPE: 
<213> ORGANISM: 


PRT 


<400> SEOUENCE: 


Gly 


T 


Asn 


Thr 


Val 


65 


Asn 


Gly 


Gly 


Gly 


Phe 


145 


Val 


Ser 


Thr 


Gly 


Trp 


225 


Asp 


Leu 


Ile 


Asn 


Ser 


305 


Ala 


Thr 


Ala 


Asp 


Tyr 


50 


Glu 


Val 


Thr 


Ala 


Asn 


130 


Asn 


Arg 


Asp 


Leu 


Ser 


210 


Lys 


Ile 


His 


Met 


Trp 
290 


Ser 


Leu 


521 


135 


140 


Infectious bronchitis virus 


7 


Gly Leu Phe 


Tyr 


Glu 


35 


Lys 


Gly 


Arg 


Asn 


Asp 


115 


Asn 


His 


Pro 


Сув 


Arg 


195 


Arg 


His 


Gln 


Cys 


Thr 
275 
Asp 


Cys 


Lys 


Ala 


20 


Leu 


His 


Cys 


Gly 


Ile 


100 


Phe 


Phe 


Leu 


Arg 


Val 


180 


Tyr 


Ala 


Cys 


Gln 


Asn 


260 


Arg 


Leu 


Arg 


Val 


5 


Ala 


Leu 


His 


Trp 


85 


Gly 


Val 


Glu 


Arg 


Ile 


165 


Val 


Phe 


Thr 


Leu 


Trp 


245 


Val 


Cys 


Thr 


Tyr 


Asn 
325 


Lys 


Thr 


Ala 


Ile 


Asn 


70 


Val 


Thr 


Val 


Pro 


Ala 


150 


Val 


Phe 


Val 


Thr 


Gly 


230 


Gly 


His 


Leu 


Tyr 


Leu 
310 


Val 


Ile 


Thr 


Leu 


Ser 


55 


Met 


Gly 


Asn 


Thr 


Val 


135 


Leu 


Gln 


Val 


Lys 


Phe 


215 


Phe 


Tyr 


Gly 


Ala 


Pro 
295 


Gln 


Val 


Cys 


Lys 


Val 


40 


Leu 


Phe 


Phe 


Leu 


Pro 


120 


Asn 


Phe 


Met 


Thr 


Ile 


200 


Asn 


Asp 


Ser 


His 


Ile 


280 


His 


Arg 


Tyr 


Asn 


Ala 


25 


Asn 


Leu 


Ile 


Asp 


Pro 


105 


Glu 


Ser 


Lys 


Leu 


Trp 


185 


Gly 


Ser 


Phe 


Gly 


Ala 


265 


Asn 


Ile 


Met 


Asp 


Lys Glu Phe 
10 


Leu Ala Ala 


Val Glu Ala 


Gly Phe Lys 


60 


Thr Arg Asp 
75 


Val Glu Ala 
90 


Phe Gln Val 


Gly Leu Val 


Lys Ala Pro 


140 


Ser Ala Lys 
155 


Ala Asp Asn 
170 


Cys His Gly 


Lys Asp Gln 


His Thr Gln 


220 


Val Tyr Asn 
235 


Asn Leu Gln 
250 


His Val Ala 


Asn Ala Phe 


Ala Asn Glu 


300 


Tyr Leu Asn 
315 


Ile Gly Asn 
330 


Ser 


Thr 


Gly 


45 


Met 


Glu 


Thr 


Gly 


Asp 


125 


Pro 


Pro 


Leu 


Leu 


Val 


205 


Ala 


Pro 


Phe 


Ser 


Cys 


285 


Asp 


Ala 


Pro 


Gly 


Tyr 


30 


Ser 


Ser 


Ala 


His 


Phe 


110 


Thr 


Gly 


Trp 


Cys 


Glu 
190 


Cys 


Tyr 


Leu 


Asn 


Ala 


270 


Gln 


Glu 


Cys 


Lys 


Val 


15 


Lys 


Glu 


Val 


Ile 


Ala 


95 


Ser 


Ser 


Glu 


His 


Asn 


175 


Leu 


Ser 


Ala 


Leu 


His 


255 


Asp 


Asp 


Val 


Val 


Gly 
335 


His 


Val 


Ile 


Asn 


Arg 


80 


Cys 


Thr 


Ile 


Gln 


Val 


160 


Val 


Thr 


Cys 


Cys 


Val 


240 


Asp 


Ala 


Val 


Asn 


Asp 


320 


Ile 


82 


Lys 


Pro 


Lys 


Asp 


385 


Leu 


Asn 


Asn 


Thr 


Lys 


465 


His 


Thr 


Leu 


Cys 


Ile 


Asp 


370 


Cys 


Ser 


Lys 


Leu 


Ile 


450 


Asp 


Ala 


Ala 


Trp 


Val 


Val 


355 


Lys 


Tyr 


Val 


His 


Lys 


435 


Gln 


Cys 


Gln 


Gly 


Lys 
515 


Arg 


340 


Pro 


Phe 


Pro 


Phe 


Ala 


420 


Ala 


Leu 


Ile 


Met 


Phe 


500 


Ser 


Arg 


Asn 


Ala 


Asp 


Asn 


405 


Phe 


Met 


Asp 


Thr 


Tyr 


485 


Thr 


Phe 


<210> SEO ID NO 8 


<211> LENGTH: 


<212> TYPE: PRT 
<213> ORGANISM: 


<400> SEOUENCE: 


Ser Ile Asp Asn 


1 


Val 


Leu 


Thr 


65 


Asn 


Asn 


Ile 


Ala 


Tyr 
145 


Lys 


Val 


Ile 


Ile 


Pro 


50 


Leu 


Gly 


Thr 


Val 


Asp 


130 


Val 


Asp 


Thr 


Ala 


Asp 


35 


Thr 


Pro 


Phe 


Val 


Leu 


115 


Asn 


Glu 


Gly 


Leu 


Gly 


20 


Gln 


Ser 


Asn 


Val 


Lys 


100 


Tyr 


Ala 


Ile 


Ala 


Pro 
180 


338 


Gly 


Val 


Asp 


Asn 


390 


Leu 


His 


Pro 


Gly 


Lys 


470 


Ala 


Phe 


Ser 


83 


Asp 
Lys 
Gly 
3:75 
Ser 
Pro 
Thr 
Phe 
Val 
455 
Cys 
Asp 


Trp 


Ala 


Leu 


Gln 


360 


Leu 


Leu 


Gly 


Pro 


Phe 


440 


Ala 


Asn 


Phe 


Val 


Leu 
520 


Asn 


345 


Phe 


Cys 


Val 


Cys 


Lys 


425 


Phe 


Gln 


Ile 


Val 


Thr 


505 


Gln 


Phe 


Glu 


Met 


Cys 


Asn 


410 


Phe 


Tyr 


Asp 


Gly 


Thr 


490 


Asn 
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Arg 


Tyr 


Phe 


Arg 


395 


Gly 


Asp 


Asp 


Leu 


Gly 


475 


Ser 


Asn 


Infectious bronchitis virus 


8 


Gly 


Val 


Asn 


Ile 


85 


Val 


Asp 


Val 


Pro 


Asn 
165 


Asn 


Ala 


Met 


Val 


Ala 


Arg 


70 


Trp 


Cys 


Asp 


Leu 


Ser 
150 


Leu 


Thr 


Tyr 


Pro 


Glu 


Phe 


55 


Ile 


Asp 


Ala 


Arg 


Val 
135 
Asn 


Tyr 


Leu 


Asn 


Thr 


Lys 


40 


Glu 


Leu 


Tyr 


Tyr 


Tyr 


120 


Ser 


Leu 


Val 


Asn 


Met 


Ile 


25 


Ala 


Leu 


Lys 


Thr 


Thr 


105 


Gly 


Thr 


Leu 


Tyr 


Thr 
185 


Tyr 


10 


Val 


Val 


Tyr 


Gly 


Asn 


90 


Asp 


Asp 


Gln 


Val 


Lys 
170 


Gln 


Lys 


Thr 


Phe 


Ala 


Leu 


75 


Gln 


Ile 


Tyr 


Cys 
Gln 
155 


Arg 


Gly 


-continued 


Phe 


Asp 


Trp 


380 


Tyr 


Gly 


Arg 


Ser 


Val 


460 


Ala 


Tyr 


Phe 


Gly 


Gly 


Phe 


Lys 


60 


Gly 


Thr 


Glu 


Gln 


Tyr 


140 


Asn 


Val 


Arg 


Tyr 


Tyr 


365 


Asn 


Asp 


Ser 


Thr 


Ser 


445 


Ser 


Val 


Asn 


Asn 


Gly 


Asp 


Asn 


45 


Arg 


Val 


Pro 


Pro 


Ser 


125 


Lys 


Gly 


Asn 


Ser 


Asp 
350 


Asn 


Cys 


Thr 


Leu 


Ser 


430 


Pro 


Leu 


Cys 


Ala 


Pro 
510 


His 


Lys 


30 


Gln 


Asn 


Asp 


Leu 


Ile 


Gly 


Tyr 
190 


Lys 


Gln 


Asn 


Cys 


Ala 


Lys 


Ala 
495 


Tyr 


Tyr 


15 


Val 


Thr 


Ile 


Val 


Tyr 


95 


Gly 


Leu 


Tyr 


Pro 


Ala 


15 


Glu 


Asn 


His 


Val 


Glu 


Thr 


Lys 


480 


Val 


Asn 


Asp 


Phe 


Ile 


Leu 


Ala 


Ser 


Leu 


160 


Phe 


Thr 


84 


Phe 


Glu 


Leu 


225 


Ile 


Val 


Asp 


Asp 


Asn 


305 


Phe 


Leu 


<210> 
<211> 
<212> 
<213> 


<400> 


Glu 


Ser 


210 


Tyr 


Gly 


Thr 


Asn 


Asp 


290 


Lys 


Met 


Gln 


Pro 


195 


Phe 


Gly 


Met 


Asn 


Gly 


275 


Phe 


Ser 


Thr 


Arg 


Val 


Glu 


Cys 


Ser 


260 


Ser 


Leu 


Lys 


Trp 


ORGANISM: 


SEQUENCE: 


Ser Ala Trp Thr 


1 


Ala 


Cys 


Val 


65 


Thr 


Asp 


Cys 


Met 


Asn 


145 


Asn 


Ser 


Thr 


Val 


Cys 


Leu 


Gln 


50 


Met 


Val 


Ile 


Asn 


Tyr 


130 


Asn 


Asn 


Trp 


Met 


Gly 
210 


Val 


Pro 


35 


Tyr 


His 


Leu 


Val 


Lys 


115 


Thr 


Gly 


Leu 


His 


Phe 
195 


Val 


Met 


20 


Ser 


Leu 


Phe 


Lys 


Asp 


100 


Tyr 


Asp 


Asn 


Ala 


Glu 
180 


Cys 


Asn 


Ser Asp 


Glu Lys 


Val Asp 
230 


Arg Leu 
245 


Asp Ser 


Tyr Lys 


Glu Leu 


Val Val 


310 


Phe Glu 
325 


SEQ ID NO 9 
LENGTH: 
TYPE: PRT 


302 


85 


Val 
Tyr 
215 
Lys 
Leu 
Asp 
Gln 
Leu 
295 


Thr 


Asp 


Glu 


200 


Gly 


Pro 


Arg 


Val 


Val 


280 


Arg 


Val 


Gly 


Arg 


Lys 


Gln 


Ala 


Met 


265 


Cys 


Asn 


Ser 


Ile 


Asp 


Glu 


Leu 


Asn 


250 


Gln 


Thr 


Ile 


Ile 


Ile 
330 
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Phe 


Leu 


Gly 


235 


Lys 


Asn 


Val 


Leu 


Asp 


315 


Lys 


Infectious bronchitis virus 


9 


Cys Gly 
5 


Glu Pro 


Gly Ile 


Ser Lys 


Gly Ala 


70 


Gln Trp 
85 


Tyr Val 


Lys Thr 


Asn Asp 


Asp Asp 


150 
Leu Gly 
165 
Val Leu 


Thr Ala 


Tyr Leu 


Tyr 


Cys 


Met 


Thr 


55 


Gly 


Leu 


Ser 


Glu 


Ser 


135 


Val 


Gly 


Tyr 


Val 


Gly 
215 


Asn 


Asn 


Met 


40 


Thr 


Ser 


Pro 


Asp 


His 


120 


Lys 


Phe 


Ser 


Asp 


Asn 
200 


Ala 


Met 


lle 


25 


Asn 


Met 


Asp 


Glu 


Ala 


105 


Lys 


Arg 


Ile 


Phe 


Ile 
185 


Ala 


Ser 


Pro 


10 


Pro 


Val 


Cys 


Lys 


Gly 


90 


His 


Phe 


Lys 


Tyr 


Ala 
170 
Ala 


Ser 


Glu 


Glu 


Asn 


Ala 


Val 


Gly 


75 


Thr 


Val 


Asp 


His 


Leu 


155 


Val 


Gln 


Ser 


Lys 


-continued 


Leu 


Gly 


220 


Gly 


Leu 


Tyr 


Val 


Lys 


300 


Tyr 


Thr 


Leu 


Tyr 


Lys 


Pro 


60 


Val 


Leu 


Ser 


Leu 


Glu 


140 


Ser 


Lys 


Asp 


Ser 


Val 
220 


Asp 


205 


Leu 


Leu 


Asn 


Phe 


Asp 


285 


Glu 


His 


Cys 


Tyr 


Gly 


Tyr 


45 


His 


Ala 


Leu 


Val 


Val 


125 


Gly 


Ser 


Val 


Cys 


Glu 
205 


Lys 


Met 


Gln 


His 


Ala 


Val 


270 


Leu 


Tyr 


Ser 


Tyr 


Lys 


Val 


30 


Thr 


Asn 


Pro 


Val 


Leu 


110 


Ile 


Val 


Phe 


Thr 


Ala 
190 


Ala 


Val 


Ser 


His 


Thr 


Lys 


255 


Leu 


Leu 


Gly 


Ile 


Pro 
335 


Val 


15 


Gly 


Gln 


Met 


Gly 


Asp 


95 


Ser 


Ser 


Ile 


Leu 


Glu 
175 
Trp 


Phe 


Ser 


Glu 


Ile 


Val 


240 


Ser 


Ala 


Leu 


Thr 


Asn 


320 


Gln 


Gln 


Ile 


Leu 


Arg 


Ser 


80 


Asn 


Asp 


Asp 


Ala 


Arg 


160 


Thr 


Trp 


Leu 


Gly 


86 


Lys 


225 


Gln 


Leu 


Leu 


Gly 


<210> 
<211> 
<212> 
<213> 
<220> 
<223> 


<400> 


Thr 


Thr 


Lys 


Val 


Asn 
290 


Leu 


Ser 


Ala 


Phe 


275 


Thr 


His 


Ala 


Thr 


260 


Asn 


Ser 


PRT 


SEQUENCE : 


Ser Lys Gly His 


1 


Ala 


His 


Asp 


65 


His 


Phe 


Gly 


Gly 


Gln 
145 


<210> 
<211> 
<212> 
<213> 
<220> 
<223> 


Cys 


Ala 


Asn 


50 


Gln 


Ile 


Lys 


Phe 


Tyr 
130 


Ser 


Gly 


35 


Gly 


Asp 


Ala 


Gly 


Cys 


115 


Gly 


Phe 


20 


Asn 


Ser 


Ser 


His 


Ser 


100 


Leu 


Cys 


PRT 


«400» SEQUENCE: 


Gly 
i 


Thr 


Val 


Thr 


Ala 


Glu 


Ala 


Tyr 


245 


Pro 


Leu 


Phe 


SEO ID NO 10 
LENGTH: 
TYPE: 
ORGANISM: Artificial Sequence 
FEATURE: 
OTHER INFORMATION: Mutated Nsp10 sequence 


145 


10 


Gln 


Gly 


Tyr 


Leu 


85 


Phe 


Arg 


Gln 


SEQ ID NO 11 
LENGTH : 
TYPE: 
ORGANISM: Artificial Sequence 
FEATURE: 
OTHER INFORMATION: Mutated Nsp14 sequence 


521 


11 


Gly Leu Phe 


Tyr 
Glu 
35 


Lys 


Gly 


Ala 


His 


Cys 


5 


His 


Asn 
230 
Ser 
Val 


Ile 


Thr 


Thr 


Val 


Pro 


Phe 


Gly 


70 


Gly 


Val 


Asn 


Cys 


Lys 


Thr 


Ala 


Ile 


Asn 


87 


Tyr 
Ile 
Val 
Lys 


Ser 
295 


Glu 
Asp 
Leu 
Ala 
55 

Gly 
Gly 
Gln 


Lys 


Asp 
135 


Ile 
Thr 
Leu 
Ser 


55 


Met 


Ile 


Phe 


Asn 


Cys 


280 


Asp 


Glu 


Pro 


Gly 


40 


Ile 


Ala 


Ala 


Ile 


Val 


120 


Ser 


Cys 


Lys 


Val 


40 


Leu 


Phe 


Phe 
Asp 
Leu 
265 


Gly 


Ser 


Val 


Ala 


25 


Asn 


Thr 


Ser 


Gly 


Pro 


105 


Cys 


Leu 


Asn 


Ala 


25 


Asn 


Leu 


Ile 


US 10,130,701 B2 


Trp Arg 
235 


Val Ala 
250 
Lys Thr 


Lys Leu 


Phe Val 


Asp Ala 
10 

Asp Thr 
Cys Val 
Ser Lys 
Val Cys 

75 

Asn Leu 
90 

Thr Thr 


Thr Val 


Arg Gln 


Lys Glu 


10 


Leu Ala 


Val Glu 


Gly Phe 


Thr Arg 


-continued 


Asn Cys 


Lys Phe 


Glu Gln 


Leu Val 
285 


Cys Thr 
300 


Val Gly 


Tyr Cys 


Lys Met 
45 


Pro Ser 
60 


Leu Tyr 


Asp Gly 


Glu Lys 


Cys Gln 
125 


Pro Lys 
140 


Phe Ser 


Ala Thr 


Ala Gly 
45 


Lys Met 
60 


Asp Glu 


Asn 


Asp 


Lys 


270 


Arg 


Met 


Ile 


Lys 


30 


Leu 


Pro 


Cys 


Arg 


Asp 


110 


Cys 


Pro 


Gly 


Tyr 


30 


Ser 


Ser 


Ala 


Tyr 
Leu 
255 


Thr 


Asp 


Leu 
15 


Tyr 


Thr 


Thr 


Arg 


Cys 


95 


Pro 


Trp 


Ser 


Val 


15 


Lys 


Glu 


Val 


Ile 


Leu 
240 
Arg 


Asp 


Val 


Ser 


Val 


Val 


Pro 


Ala 


80 


Gln 


Val 


Ile 


Val 


His 


Val 


Ile 


Asn 


Arg 


88 


65 


Asn 


Gly 


Gly 


Gly 


Phe 


145 


Val 


Ser 


Thr 


Gly 


Trp 


225 


Asp 


Leu 


Ile 


Asn 


Ser 


305 


Ala 


Lys 


Pro 


Lys 


Asp 


385 


Leu 


Asn 


Asn 


Thr 


Lys 
465 


His 


Val 


Thr 


Ala 


Asn 


130 


Asn 


Arg 


Asp 


Leu 


Ser 


210 


Lys 


Ile 


His 


Met 


Trp 


290 


Ser 


Leu 


Cys 


Ile 


Asp 


370 


Сув 


Ser 


Lys 


Leu 


Ile 
450 


Asp 


Ala 


Arg 


Asn 


Asp 


115 


Asn 


His 


Pro 


Cys 


Arg 


195 


Arg 


His 


Gln 


Cys 


Thr 


275 


Asp 


Cys 


Lys 


Val 


Val 


355 


Lys 


Tyr 


Val 


His 


Lys 


435 


Gln 


Cys 


Gln 


Gly 


Ile 


100 


Phe 


Phe 


Leu 


Arg 


Val 


180 


Tyr 


Ala 


Cys 


Gln 


Asn 


260 


Arg 


Leu 


Arg 


Val 


Arg 


340 


Pro 


Phe 


Pro 


Phe 


Ala 


420 


Ala 


Leu 


Ile 


Met 


Тер. 


85 


Gly 


Val 


Glu 


Arg 


Ile 


165 


Val 


Phe 


Thr 


Leu 


Trp 


245 


Val 


Cys 


Thr 


Tyr 


Asn 


325 


Arg 


Asn 


Ala 


Asp 


Asn 


405 


Phe 


Met 


Asp 


Thr 


Tyr 
485 


70 


Val 


Thr 


Val 


Pro 


Ala 


150 


Val 


Phe 


Val 


Thr 


Gly 


230 


Gly 


His 


Leu 


Tyr 


Leu 


310 


Val 


Gly 


Val 


Asp 


Asn 


390 


Leu 


His 


Pro 


Gly 


Lys 


470 


Ala 


89 


Gly 
Asn 
Thr 
Val 
135 
Leu 
Gln 
Val 
Lys 
Phe 
215 
Phe 
Tyr 
Gly 
Ala 
Pro 
295 
Gln 
Val 
Asp 
Lys 
Gly 
375 
Ser 
Pro 
Thr 


Phe 


Val 
455 


Cys 


Asp 


Phe 


Leu 


Pro 


120 


Asn 


Phe 


Met 


Thr 


Ile 


200 


Asn 


Asp 


Ser 


His 


lle 


280 


His 


Arg 


Tyr 


Leu 


Gln 


360 


Leu 


Leu 


Gly 


Pro 


Phe 
440 
Ala 


Asn 


Phe 


Asp 


Pro 


105 


Glu 


Ser 


Lys 


Leu 


Trp 


185 


Gly 


Ser 


Phe 


Gly 


Ala 


265 


Asn 


Ile 


Met 


Asp 


Asn 


345 


Phe 


Cys 


Leu 


Cys 


Lys 


425 


Phe 


Gln 


Ile 


Val 


Val 


90 


Phe 


Gly 


Lys 


Ser 


Ala 
170 


Cys 


Lys 


His 


Val 


Asn 


250 


His 


Asn 


Ala 


Tyr 


Ile 


330 


Phe 


Glu 


Met 


Cys 


Asn 


410 


Phe 


Tyr 


Asp 


Gly 


Thr 
490 
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75 


Glu 


Gln 


Leu 


Ala 


Ala 


155 


Asp 


His 


Asp 


Thr 


Tyr 


235 


Leu 


Val 


Ala 


Asn 


Leu 


315 


Gly 


Arg 


Tyr 


Phe 


Arg 


395 


Gly 


Asp 


Asp 


Leu 


Gly 
475 


Ser 


-continued 


Ala 


Val 


Val 


Pro 


140 


Lys 


Asn 


Gly 


Gln 


Gln 


220 


Asn 


Gln 


Ala 


Phe 


Glu 


300 


Asn 


Asn 


Phe 


Asp 


Trp 


380 


Tyr 


Gly 


Arg 


Ser 


Val 
460 


Ala 


Tyr 


Thr 


Gly 


Asp 


125 


Pro 


Pro 


Leu 


Leu 


Val 


205 


Ala 


Pro 


Phe 


Ser 


Cys 


285 


Asp 


Ala 


Pro 


Tyr 


Tyr 


365 


Asn 


Asp 


Ser 


Thr 


Ser 
445 
Ser 


Val 


Asn 


His 


Phe 


110 


Thr 


Gly 


Trp 


Cys 


Glu 
190 


Cys 


Tyr 


Leu 


Asn 


Ala 


270 


Gln 


Glu 


Cys 


Lys 


Asp 


350 


Asn 


Cys 


Thr 


Leu 


Ser 


430 


Pro 


Leu 


Cys 


Ala 


Ala 


95 


Ser 


Ser 


Glu 


His 


Asn 


175 


Leu 


Ser 


Ala 


Leu 


His 


255 


Asp 


Asp 


Val 


Val 


Gly 


335 


Lys 


Gln 


Asn 


Cys 


Ala 


Lys 


Ala 
495 


80 


Cys 


Thr 


Ile 


Gln 


Val 


160 


Val 


Thr 


Cys 


Cys 


Val 


240 


Asp 


Ala 


Val 


Asn 


Asp 


320 


Ile 


Asn 


His 


Val 


Glu 


Thr 


Lys 


480 


Val 


90 


91 


US 10,130,701 B2 


-continued 


Thr Ala Gly Phe Thr Phe Trp Val Thr Asn Asn Phe Asn Pro Tyr Asn 


500 


505 


Leu Trp Lys Ser Phe Ser Ala Leu Gln 


<210> 
<211> 
<212> 
<213> 
<220> 
<223> 


515 


PRT 


«400» SEQUENCE: 


Ser 


i 


Val 


Leu 


Thr 


65 


Asn 


Asn 


Ile 


Ala 


Tyr 


145 


Lys 


Val 


Phe 


Glu 


Leu 


225 


Ile 


Val 


Asp 


Asp 


Asn 


305 


Phe 


Ile 


Ile 


Ile 


Pro 


50 


Leu 


Gly 


Thr 


Val 


Asp 


130 


Val 


Asp 


Thr 


Glu 


Ser 


210 


Tyr 


Gly 


Thr 


Asn 


Asp 
290 


Lys 


Met 


SEO ID NO 12 
LENGTH: 
TYPE: 
ORGANISM: Artificial Sequence 
FEATURE: 
OTHER INFORMATION: Mutated Nspi5 sequence 


338 


12 


Asp Asn Ile 


Ala 


Asp 


35 


Thr 


Pro 


Phe 


Val 


Leu 


115 


Asn 


Glu 


Gly 


Leu 


Pro 


195 


Phe 


Gly 


Met 


Asn 


Gly 


275 


Phe 


Ser 


Thr 


Gly 


20 


Gln 


Ser 


Asn 


Val 


Lys 


100 


Tyr 


Ala 


lle 


Ala 


Pro 


180 


Arg 


Val 


Glu 


Cys 


Ser 


260 


Ser 


Leu 


Lys 


Trp 


5 


Gly 


Val 


Asn 


Ile 


85 


Val 


Asp 


Val 


Pro 


Asn 


165 


Asn 


Ser 


Glu 


Val 


Arg 


245 


Asp 


Tyr 


Glu 


Val 


Phe 
325 


Ala 


Met 


Val 


Ala 


Arg 


70 


Trp 


Cys 


Asp 


Leu 


Ser 


150 


Leu 


Thr 


Asp 


Lys 


Asp 


230 


Leu 


Ser 


Lys 


Leu 


Val 


310 


Glu 


Tyr 


Pro 


Glu 


Phe 


55 


Ile 


Asp 


Ala 


Arg 


Val 


135 


Asn 


Tyr 


Ile 


Val 


Tyr 


215 


Lys 


Leu 


Asp 


Gln 


Leu 
295 


Thr 


Asp 


520 


Asn 


Thr 


Leu 


Tyr 


Tyr 


Tyr 


120 


Ser 


Leu 


Val 


Asn 


Glu 


200 


Gly 


Pro 


Arg 


Val 


Val 
280 
Arg 


Val 


Gly 


Met 


Ile 


25 


Ala 


Leu 


Lys 


Thr 


Thr 


105 


Gly 


Thr 


Leu 


Tyr 


Thr 


185 


Arg 


Lys 


Gln 


Ala 


Met 


265 


Cys 


Asn 


Ser 


Ile 


Tyr 


10 


Val 


Val 


Tyr 


Gly 


Asn 


90 


Asp 


Asp 


Gln 


Val 


Lys 


170 


Gln 


Asp 


Glu 


Leu 


Asn 


250 


Gln 


Thr 


Ile 


Ile 


Ile 
330 


Lys 


Thr 


Phe 


Ala 


Leu 


75 


Gln 


Ile 


Tyr 


Cys 


Gln 


155 


Arg 


Gly 


Phe 


Leu 


Gly 


235 


Lys 


Asn 


Val 


Leu 


Asp 
315 


Lys 


Gly 


Gly 


Phe 


Lys 


60 


Gly 


Thr 


Glu 


Gln 


Tyr 


140 


Asn 


Val 


Arg 


Leu 


Gly 


220 


Gly 


Leu 


Tyr 


Val 


Lys 
300 


Tyr 


Thr 


Gly 


Asp 


Asn 


45 


Arg 


Val 


Pro 


Pro 


Ser 


125 


Lys 


Gly 


Asn 


Ser 


Asp 


205 


Leu 


Leu 


Asn 


Phe 


Asp 


285 


Glu 


His 


Cys 


510 


His 


Lys 


30 


Gln 


Asn 


Asp 


Leu 


Asn 


110 


Phe 


Arg 


Ile 


Gly 


Tyr 


190 


Met 


Gln 


His 


Ala 


Val 
270 


Leu 


Tyr 


Ser 


Tyr 


Tyr 
15 

Val 
Thr 


Ile 


Val 


Tyr 


95 


Gly 


Leu 


Tyr 


Pro 


Ala 


175 


Glu 


Ser 


His 


Thr 


Lys 


255 


Leu 


Leu 


Gly 


Ile 


Pro 
335 


Asp 


Phe 


Ile 


Arg 


Thr 


80 


Arg 


Leu 


Ala 


Ser 


Leu 


160 


Phe 


Thr 


Glu 


Ile 


Val 


240 


Ser 


Ala 


Leu 


Thr 


Asn 
320 


Gln 


92 


Leu 


<210> 
<211> 
<212> 
<213> 
<220> 
<223> 


Gln 


PRT 


«400» SEQUENCE: 


Ser 


i 


Ala 


Cys 


Val 


65 


Thr 


Asp 


Cys 


Met 


Asn 


145 


Asn 


Ser 


Thr 


Ile 


Lys 


225 


Gln 


Leu 


Leu 


Gly 


Ala 


Cys 


Leu 


Gln 


50 


Met 


Val 


Ile 


Asn 


Tyr 


130 


Asn 


Asn 


Trp 


Met 


Gly 


210 


Thr 


Thr 


Lys 


Val 


Asn 
290 


Trp 


Val 


Pro 


35 


Tyr 


His 


Leu 


Val 


Lys 


115 


Thr 


Gly 


Leu 


His 


Phe 


195 


Val 


Leu 


Ser 


Ala 


Phe 


275 


Thr 


Thr 


Met 


20 


Ser 


Leu 


Phe 


Lys 


Asp 


100 


Tyr 


Asp 


Asn 


Ala 


Glu 


180 


Cys 


Asn 


His 


Ala 


Thr 


260 


Asn 


Ser 


SEQ ID NO 13 
LENGTH: 
TYPE: 
ORGANISM: Artificial Sequence 
FEATURE: 
OTHER INFORMATION: Mutated Nspl6 sequence 


302 


13 


Gly 


Ser 


Gly 


Gln 


85 


Tyr 


Lys 


Asn 


Asp 


Leu 


165 


Val 


Thr 


Tyr 


Ala 


Tyr 


245 


Pro 


Leu 


Phe 


Gly 


Pro 


Ile 


Lys 


Ala 


70 


Trp 


Val 


Thr 


Asp 


Asp 


150 


Gly 


Leu 


Ala 


Leu 


Asn 


230 


Ser 


Val 


Ile 


Thr 


93 


Tyr 
Cys 
Met 
Thr 
55 

Gly 
Leu 
Ser 
Glu 
Ser 
135 
Val 
Gly 
Tyr 
Val 
Gly 
215 
Tyr 
Ile 
Val 


Lys 


Ser 
295 


The invention claimed is: 


1. A live, attenuated coronavirus comprising a variant 
replicase gene encoding polyproteins comprising a mutation 
in one or both of non-structural protein(s) nsp-10 and 
nsp-14, wherein the variant replicase gene encodes a protein 


Asn 


Asn 


Met 


40 


Thr 


Ser 


Pro 


Asp 


His 


120 


Lys 


Phe 


Ser 


Asp 


Asn 


200 


Ala 


Ile 


Phe 


Asn 


Cys 


280 


Asp 


Met 


Ile 


25 


Asn 


Met 


Asp 


Glu 


Ala 


105 


Lys 


Arg 


Ile 


Phe 


Ile 


185 


Ala 


Ser 


Phe 


Asp 


Leu 


265 


Gly 


Ser 


Pro 


10 


Pro 


Val 


Cys 


Lys 


Gly 


90 


His 


Phe 


Lys 


Tyr 


Ala 


170 


Ala 


Ser 


Glu 


Trp 


Val 


250 


Lys 


Lys 


Phe 
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Glu 


Asn 


Ala 


Val 


Gly 


75 


Thr 


Val 


Asp 


His 


Leu 


155 


Val 


Gln 


Ser 


Lys 


Arg 


235 


Ala 


Thr 


Leu 


Val 


94 


-continued 


Leu 


Tyr 


Lys 


Pro 


60 


Val 


Leu 


Ser 


Leu 


Glu 


140 


Ser 


Lys 


Asp 


Ser 


Val 


220 


Asn 


Lys 


Glu 


Leu 


Cys 
300 


Tyr 


Gly 


Tyr 


45 


His 


Ala 


Leu 


Val 


Val 


125 


Gly 


Ser 


Val 


Cys 


Glu 


205 


Lys 


Cys 


Phe 


Gln 


Val 


285 


Thr 


Lys Val Gln 
15 


Val Gly Ile 
30 


Thr Gln Leu 
Asn Met Arg 
Pro Gly Ser 


80 


Val Asp Asn 
95 


Leu Ser Asp 
110 


Ile Ser Asp 
Val Ile Ala 
Phe Leu Arg 


160 


Thr Glu Thr 
175 


Ala Trp Trp 
190 


Ala Phe Leu 
Val Ser Gly 
Asn Tyr Leu 


240 


Asp Leu Arg 
255 


Lys Thr Asp 
270 


Arg Asp Val 


Met 


60 comprising an amino acid mutation of Val to Leu at the 
position corresponding to position 393 of SEQ ID NO: 7. 
2. The coronavirus according to claim 1 wherein the 
variant replicase gene encodes a protein comprising one or 
more amino acid mutations selected from: 


comprising an amino acid mutation of Pro to Leu at the 65 


position corresponding to position 85 of SEQ ID NO: 6, 
and/or wherein the variant replicase gene encodes a protein 


an amino acid mutation of Leu to lle at the position 
corresponding to position 183 of SEQ ID NO: 8; and 


US 10,130,701 B2 


95 
an amino acid mutation of Val to lle at the position 
corresponding to position 209 of SEQ ID NO: 9. 

3. The coronavirus according to claim 1 wherein the 
replicase gene encodes a protein comprising the amino acid 
mutations Val to Leu at the position corresponding to 
position 393 of SEQ ID NO: 7; Leu to lle at the position 
corresponding to position 183 of SEQ ID NO: 8; and Val to 
Ile at the position corresponding to position 209 of SEO ID 
NO: 9. 

4. The coronavirus according to claim 1 wherein the 
replicase gene encodes a protein comprising the amino acid 
mutations Pro to Leu at the position corresponding to 
position 85 of SEO ID NO: 6; Val to Leu at the position 
corresponding to position 393 of SEO ID NO: 7; Leu to Ile 
at the position corresponding to position 183 of SEO ID NO: 
8; and Val to Ile at the position corresponding to position 209 
of SEO ID NO: 9. 

5. The coronavirus according to claim 1 wherein the 
replicase gene comprises at least one nucleotide substitu- 
tions selected from: 

C to Tat nucleotide position 12137; and 

G to C at nucleotide position 18114; 

compared to the seguence shown as SEO ID NO: 1; 

and optionally, comprises one or more nucleotide substi- 

tutions selected from T to A at nucleotide position 
19047; and 

G to A at nucleotide position 20139; 
compared to the seguence shown as SEO ID NO: 1. 

6. The coronavirus according to claim 1 which is an 
infectious bronchitis virus (IBV). 

7. The coronavirus according to claim 1 which is IBV 
M41. 

8. The coronavirus according to claim 7, which comprises 
an S protein at least, part of which is from an IBV serotype 
other than M41. 

9. The coronavirus according to claim 8, wherein the S1 
subunit is from an IBV serotype other than M41. 

10. The coronavirus according to claim 8, wherein the S 
protein is from an IBV serotype other than M41. 

11. The coronavirus according to claim 1 which has 
reduced pathogenicity compared to a coronavirus expressing 
a corresponding wild-type replicase, wherein the virus is 
capable of replicating without being pathogenic to the 
embryo when administered to an embryonated egg. 


S 


10 


15 


20 


25 


30 


35 


40 


96 


12. A variant replicase gene as defined in claim 1. 

13. A protein encoded by a variant coronavirus replicase 
gene according to claim 12. 

14. A plasmid comprising a replicase gene according to 
claim 12. 

15. A method for making the coronavirus according to 
claim 1 which comprises the following steps: 

(1) transfecting a plasmid according to claim 14 into a host 

cell; 

(ii) infecting the host cell with a recombining virus 
comprising the genome of a coronavirus strain with a 
replicase gene; 

(111) allowing homologous recombination to occur 
between the replicase gene sequences in the plasmid 
and the corresponding sequences in the recombining 
virus genome to produce a modified replicase gene; and 

(iv) selecting for recombining virus comprising the modi- 
fied replicase gene. 

16. The method according to claim 15, wherein the 

recombining virus is a vaccinia virus. 

17. The method according to claim 15 which also includes 
the step: 

(v) recovering recombinant coronavirus comprising the 
modified replicase gene from the DNA from the recom- 
bining virus from step (iv). 

18. A cell capable of producing a coronavirus according 

to claim 1. 

19. A vaccine comprising a coronavirus according to 
claim 1 and a pharmaceutically acceptable carrier. 

20. A method for treating and/or preventing a disease in 
a subject which comprises the step of administering a 
vaccine according to claim 19 to the subject. 

21. The method of claim 20, wherein the disease is 
infectious bronchitis (IB). 

22. The method according to claim 20 wherein the method 
of administration is selected from the group consisting of; 
eye drop administration, intranasal administration, drinking 
water administration, post-hatch injection and in ovo injec- 
tion. 

23. The method according to claim 21 wherein the admin- 
istration is in ovo vaccination. 

24. A method for producing a vaccine according to claim 
19, which comprises the step of infecting a cell according to 
claim 18 with a coronavirus according to claim 1. 

25. The coronavirus according to claim 1, further com- 
prising a mutation in one or both of nsp-15 and nsp-16. 


* * * * * 


